Databricksto acquire Arcion

Arcion’s connectors will simplify and accelerate ingesting data from enterprise databases to the Databricks Lakehouse Platform.

  • Monday, 6th November 2023 Posted 5 months ago in by Phil Alsop

Databricks has agreed to acquire Arcion, a Databricks Ventures portfolio company that helps enterprises quickly and reliably replicate data across on-prem, cloud databases and data platforms. This will enable Databricks to provide native solutions to ingest data from various databases and SaaS applications into the Databricks Lakehouse Platform. The transaction is valued at over $100 million, inclusive of incentives.

Data Lakehouse Platforms have emerged as the de facto standard for enterprise data and AI platforms. However, these data platforms are only as valuable as the data in them. Ingesting data from existing databases and applications remains complicated, fragile, and costly. Troves of important data sit not only in transactional databases such as Oracle, MySQL, and Postgres, but also in SaaS applications such as Salesforce, SAP, and Workday. According to a recent MIT Technology Review Insights and Databricks survey of senior data and technology executives (“Laying the foundation for data- and AI-led growth”), businesses still suffer from many siloed systems; 34% have 10+ systems, and of the largest companies, more than 80% have 10+ systems to juggle.

This acquisition will enable Databricks to natively provide a scalable, easy-to-use, and cost-effective solution to ingest data from various enterprise data sources. Building on a scalable change data capture (CDC) engine, Arcion offers connectors for over 20 enterprise databases and data warehouses. The integration will simplify ingesting such data either continuously or on-demand into the lakehouse, fully integrated with the enterprise security, governance, and compliance capabilities of the Databricks platform.

"To build analytical dashboards, data applications, and AI models, data needs to be replicated from the systems of record like CRM, ERP, and enterprise apps to the Lakehouse,” said Ali Ghodsi, Co-Founder and CEO at Databricks. “Arcion’s highly reliable and easy-to-use solution will enable our customers to make that data available almost instantly for faster and more informed decision-making. Arcion will be a great asset to Databricks, and we are excited to welcome the team and work with them to further develop solutions to help our customers accelerate their data and AI journeys.”

”Arcion’s real-time, large-scale CDC data pipeline technology extends Databricks' market-leading ETL solution to include replication of operational data in real-time,” said Gary Hagmueller, CEO of Arcion. “Databricks has been a great partner and investor in Arcion, and we are very excited to join forces to help companies simplify and accelerate their data and AI business momentum.”