SHARE
Facebook X Pinterest WhatsApp

IBM Launches Cloud Development Hub for Apache Spark

Data scientists can analyze big data quickly with the IBM Cloud Bluemix platform-based hub.

Written By
thumbnail
Sue Walsh
Sue Walsh
Jun 8, 2016

IBM has announced the creation of the first cloud-based development environment for near real-time analytics using Apache Spark. In the June 7 announcement, the company said the new hub, which they’ve named the Data Science Experience, offers 250 curated data sets, a collection of open source tools and a collaborative workspace, all designed to assist data scientists in finding and sharing meaningful insights with developers. It’s hoped that these insights will result in more rapidly developed applications.

According to IBM’s press release, the new hub builds on IBM’s $300 million investment in Apache Spark.  Billed as a type of analytics OS, the Data Science Experience will extend Spark to over two million members of the R community. This will be achieved though new contributions to SparkR, SparkSQL and Apache SparkML, the company stated. This means data scientists who work in R will have faster access to increased data, which means more insights.

“With Apache Spark, we see an opportunity to significantly transform the role of the data scientist by providing access to curated data sets, open source tools and a collaborative platform to accelerate innovation,” said Bob Picciano, senior vice president of IBM Analytics.

The open and collaborative environment will enable data scientists to bring in data and other open-source resources from IBM, Jupyter Notebooks, H2O, RStudio and many others in a single and secure environment. Curating and analyzing that data will be sped up and simplified, the announcement explained.

IBM has made over 3,000 contributions to analytics related projects in the last year, including Apache Toree, EclairJS, Apache Quarks, Apache Mesos and Apache Spark sub-projects SparkSQL, SparkR, MLLib, and PySpark.

Spark has also been baked into the core of its popular Watson IoT platform as well as their commerce, analytics and cloud platforms. In addition, they have 30 other Spark-based offerings available, the company stated in their release.

Advertisement

Why Apache Spark Is Hot

thumbnail
Sue Walsh

Sue Walsh is News Writer for RTInsights, and a freelance writer and social media manager living in New York City. Her specialties include tech, security and e-commerce. You can follow her on Twitter at @girlfridaygeek.

Recommended for you...

Open Source Talent Shortage Expected To Increase in 2022
David Curry
Jul 12, 2022
Volvo Puts IoT and AI in the Driver’s Seat for Vehicle Connectivity
Sue Walsh
Nov 6, 2020
Cybersecurity and Digital Trust Companies Team for IoT Threats Detection
Sue Walsh
Oct 12, 2020
Cornell Researchers Create the Country’s First Statewide IoT Network
Sue Walsh
Oct 9, 2020

Featured Resources from Cloud Data Insights

The Manual Migration Trap: Why 70% of Data Warehouse Modernization Projects Exceed Budget or Fail
The Difficult Reality of Implementing Zero Trust Networking
Misbah Rehman
Jan 6, 2026
Cloud Evolution 2026: Strategic Imperatives for Chief Data Officers
Why Network Services Need Automation
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.