SHARE
Facebook X Pinterest WhatsApp

Databricks Contributes To Three Open Source Projects

thumbnail
Databricks Contributes To Three Open Source Projects

At its recent summit, Databricks announced contributions to three open source projects: Delta Lake, MLflow, and Apache Spark.

Written By
thumbnail
David Curry
David Curry
Jul 20, 2022

Databricks announced at the Data & AI Summit, organized by the company, new contributions to open-source projects Delta Lake, MLflow, and Apache Spark. 

With the launch of Delta Lake 2.0, Databricks will be passing it over the Linux Foundation and open-sourcing all the APIs associated with the release. Several competitors had complained about Delta Lake’s status, whether it was open source or proprietary, and Databricks says this move should allay these complaints. 

SEE ALSO: Is the Data Cloud Alliance for Data Openness or for Google?

Delta Lake has 6,400 members with contributing developers from over 90 organizations. Contributor strength increased by 60 percent over the past year, and average lines of code commit were up 900 percent year-on-year. 

MLflow 2.0 offers developers with faster execution at scale and less time to production through standardization, with production ready templates for data scientists to access without the need for production engineers. 

The introduction of Spark Connect for Apache Spark aims to provide better stability and allow for remote connectivity with Spark from any device. Databricks also announced Project Lightspeed, the next generation of Spark streaming engine.

“From the beginning, Databricks has been committed to open standards and the open source community. We have created, contributed to, fostered the growth of, and donated some of the most impactful innovations in modern open source technology,” said Ali Ghodsi, co-Founder and CEO of Databricks. “Open data lakehouses are quickly becoming the standard for how the most innovative companies handle their data and AI. Delta Lake, MLflow and Spark are all core to this architectural transformation, and we’re proud to do our part in accelerating their innovation and adoption.”

The Delta Lake 2.0 Release Candidate is expected to be fully released later in the year. 

thumbnail
David Curry

David is a technology writer with several years experience covering all aspects of IoT, from technology to networks to security.

Recommended for you...

The Observability Gap AI Exposed
Tim Gasper
Jan 21, 2026
Data Immediacy’s Next Step
Smart Talk Episode 9: Apache Iceberg and Streaming Data Architectures
Smart Talk Episode 5: Disaggregation of the Observability Stack

Featured Resources from Cloud Data Insights

SAP Transformation Needs a Toolbox, Not a Hammer 
Tim Wintrip
Feb 10, 2026
AI at Scale Is an Operating Model Problem, Not a Technology One
Real-time Analytics News for the Week Ending February 7
AI as a Co-Pilot, Not a Replacement: The Ethical Path to Integrating AI into Business
Mohamed Yousuf
Feb 8, 2026
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.