SHARE
Facebook X Pinterest WhatsApp

Databricks Contributes To Three Open Source Projects

thumbnail
Databricks Contributes To Three Open Source Projects

At its recent summit, Databricks announced contributions to three open source projects: Delta Lake, MLflow, and Apache Spark.

Written By
thumbnail
David Curry
David Curry
Jul 20, 2022

Databricks announced at the Data & AI Summit, organized by the company, new contributions to open-source projects Delta Lake, MLflow, and Apache Spark. 

With the launch of Delta Lake 2.0, Databricks will be passing it over the Linux Foundation and open-sourcing all the APIs associated with the release. Several competitors had complained about Delta Lake’s status, whether it was open source or proprietary, and Databricks says this move should allay these complaints. 

SEE ALSO: Is the Data Cloud Alliance for Data Openness or for Google?

Delta Lake has 6,400 members with contributing developers from over 90 organizations. Contributor strength increased by 60 percent over the past year, and average lines of code commit were up 900 percent year-on-year. 

MLflow 2.0 offers developers with faster execution at scale and less time to production through standardization, with production ready templates for data scientists to access without the need for production engineers. 

The introduction of Spark Connect for Apache Spark aims to provide better stability and allow for remote connectivity with Spark from any device. Databricks also announced Project Lightspeed, the next generation of Spark streaming engine.

“From the beginning, Databricks has been committed to open standards and the open source community. We have created, contributed to, fostered the growth of, and donated some of the most impactful innovations in modern open source technology,” said Ali Ghodsi, co-Founder and CEO of Databricks. “Open data lakehouses are quickly becoming the standard for how the most innovative companies handle their data and AI. Delta Lake, MLflow and Spark are all core to this architectural transformation, and we’re proud to do our part in accelerating their innovation and adoption.”

The Delta Lake 2.0 Release Candidate is expected to be fully released later in the year. 

thumbnail
David Curry

David is a technology writer with several years experience covering all aspects of IoT, from technology to networks to security.

Recommended for you...

Data Immediacy’s Next Step
Smart Talk Episode 9: Apache Iceberg and Streaming Data Architectures
Smart Talk Episode 5: Disaggregation of the Observability Stack
Smart Talk Episode 4: Real-Time Data and Vector Databases

Featured Resources from Cloud Data Insights

The Difficult Reality of Implementing Zero Trust Networking
Misbah Rehman
Jan 6, 2026
Cloud Evolution 2026: Strategic Imperatives for Chief Data Officers
Why Network Services Need Automation
The Shared Responsibility Model and Its Impact on Your Security Posture
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.