Databricks
Databricks is the most commercially successful pure-play AI and data company in history — and the clearest proof that the platform layer captures more durable enterprise value than model builders. On February 9, 2026, Databricks confirmed it had crossed $5.4 billion in annualized revenue in its Q4 (January 2026), growing more than 65% year-over-year with positive free cash flow, completing a $5 billion equity raise at a $134 billion valuation. AI products specifically generate $1.4 billion in annualized revenue — approximately 26% of total, growing faster than the base business. More than 800 customers spend over $1 million annually on Databricks. More than 70 spend over $10 million.
Founded by the creators of Apache Spark at UC Berkeley, Databricks built its platform on the idea that the separation of data warehouses and data lakes was a false dichotomy — the lakehouse architecture unifies both. In 2026, the platform extends to a complete AI stack: Mosaic AI for foundation model training and fine-tuning, MLflow for experiment tracking and model registry (the industry-standard open-source ML lifecycle tool), Unity Catalog for data and AI governance, Lakebase (serverless Postgres for AI agents), and Genie (conversational AI for any employee to chat with their data). Net dollar retention above 140% — meaning existing customers grow their spend by 40%+ year-over-year — reflects a platform that compounds value with usage.
- $5.4B ARR (Jan 2026); 65% YoY growth; $134B valuation (Series L)
- $1.4B AI products ARR — 26% of total, growing faster than platform
- 800+ customers at $1M+ ARR; 70+ at $10M+ ARR
- Mosaic AI: full stack LLM training, fine-tuning, deployment, RAG
- MLflow: industry-standard open-source ML experiment tracking
- Net dollar retention above 140% — strong expansion revenue
Databricks holds #1 by every commercially relevant metric in enterprise ML infrastructure: revenue scale, growth rate, net dollar retention, customer concentration, and strategic position. It is the company where enterprise ML workloads converge — from data engineering to model training to production deployment to AI agent infrastructure. Its IPO, when it comes, will be a defining moment for the enterprise AI software market. For technology leaders evaluating ML platform strategy, Databricks is the default consideration for any organization where data engineering and ML workflows overlap — which is nearly all of them at scale.

