Real-time Analytics News for the Week Ending June 8

PinIt

In this week’s real-time analytics news: The Snowflake Data Cloud Summit 2024 produced dozens of company and partner announcements.

Keeping pace with news and developments in the real-time analytics and AI market can be a daunting task. Fortunately, we have you covered with a summary of the items our staff comes across each week. And if you prefer it in your inbox, sign up here!

Snowflake made a number of announcements at this week’s Snowflake Data Cloud Summit 2024. To start, it announced new features and enhancements to Snowflake Cortex AI that offer users easy, efficient, and trusted ways to create AI-powered applications. In addition, Snowflake is further democratizing how any user can customize AI for specific industry use cases through a new no-code interactive interface, access to industry-leading large language models (LLMs), and serverless fine-tunings.

Snowflake is also making it easier to operationalize models with an integrated experience for machine learning (ML) through Snowflake ML, enabling developers to build, discover, and govern models and features across the ML lifecycle.

Other Snowflake announcements from the summit include:

The announcement of Polaris Catalog, a vendor-neutral, open catalog implementation for Apache Iceberg. Polaris Catalog will be open sourced in the next 90 days to provide enterprises and the entire Iceberg community with new levels of choice, flexibility, and control over their data, with full enterprise security and Apache Iceberg interoperability with Amazon Web Services (AWS), Confluent, Dremio, Google Cloud, Microsoft Azure, Salesforce, and more.

The company is advancing Snowflake Horizon, Snowflake’s built-in governance and discovery solution. As a part of Snowflake Horizon’s new capabilities, the Internal Marketplace (private preview) allows users to curate and publish data products such as data, models, and applications specifically for teams within their organization to discover and use — while preventing unintended sharing to external parties.

Snowflake announced Snowflake Notebooks (now public preview) that natively integrated with the full breadth of the Snowflake platform including Snowpark MLStreamlit, and Snowflake Cortex AI. Snowflake Notebooks provides a single, easy-to-use development interface for Python, SQL, and Markdown. Developers can also leverage Snowflake Notebooks to experiment and iterate on their ML pipelines, harness AI-powered editing features, simplify data engineering workflows, and more to unlock increased productivity and collaborative development.

The company also announced a new collaboration with NVIDIA that aims to help users build customized AI data applications in Snowflake, powered by NVIDIA AI. With this latest collaboration, Snowflake has adopted NVIDIA AI Enterprise software to integrate NeMo Retriever microservices into Snowflake Cortex AI. In addition, Snowflake Arctic, the open, enterprise-grade LLM, is now fully supported with NVIDIA TensorRT-LLM software, providing users with highly optimized performance.

Partner announcements at the Snowflake Data Cloud Summit 2024

Airbyte announced the launch of its Snowflake Cortex destination, a new data connector that is the first for Snowflake users who are interested in building Generative AI (GenAI) capabilities directly within their existing Snowflake accounts.

Alation announced a deepened integration with Snowflake Horizon to increase data quality and deliver AI-ready data for organizations. Alation’s Open Data Quality Framework, now supports integration with Snowflake Horizon, enabling organizations to proactively detect and address data quality issues before they impact business or data operations, facilitating improved governance, faster and more accurate AI model development, and trusted decision-making across the enterprise.

AtScale announced the launch of AtScale’s Semantic Layer Platform on Snowflake Marketplace. AtScale’s Snowflake Native App provides consistent and governed semantic data definitions for self-service BI and AI initiatives natively within joint customers’ Snowflake accounts. The implementation of AtScale’s Semantic Layer enables Snowflake customers to deliver KPI consistency across their organization, enable OLAP-style analytics, and deliver low-latency BI performance.

Census announced the launch of Live Syncs, one of the fastest-ever data activation solutions for Snowflake. Developed in partnership with Snowflake’s product team. Census Live Syncs offers near real-time CDP capabilities on top of the AI Data Cloud. This Composable CDP approach transforms the AI Data Cloud infrastructure that companies already have into an engine that drives business growth and revenue, delivering huge cost savings and data-driven decisions without complex engineering. 

Coalesce announced the launch of its new Coalesce Marketplace. Currently available in private preview, Coalesce Marketplace offers over 30 certified, ready-to-use code collections, known as Packages, to expedite the development and delivery of Snowflake data projects. Every Package on Coalesce Marketplace is certified to meet quality and performance standards.

Fivetran announced an expanded partnership with Snowflake, including Iceberg Table Support and a commitment to build Native Connectors for Snowflake. Fivetran’s support of Iceberg Tables provides Snowflake customers with the ability to create a lakehouse architecture with Apache Iceberg, all within the Snowflake AI Data Cloud. Further deepening the partnership, Fivetran also announced the development of Native Apps. This development will provide a faster way for Snowflake customers to leverage Fivetran’s connectors directly from the Snowflake Marketplace to replicate data into the AI Data Cloud.

H2O.ai announced that it has launched native H2O ML and Generative AI Apps in Snowflake Marketplace. H2O.ai’s Snowflake Native Apps provides seamless integrated workflows, natively within joint customers’ Snowflake accounts. Three H2O ML and GenAI Bundles are now available for Snowflake users: H2O Predictive Modeling Starter Pack, H2O GenAI LLM Starter Pack, and H2O Machine Learning Starter Pack.

Immuta announced granular data policy support for Snowflake Iceberg Tables. The Immuta policy engine now supports full row- and column-level security on Snowflake-managed and unmanaged Iceberg tables, ensuring consistent native policy enforcement and continuous monitoring with unified audit capabilities across data platforms.

Informatica announced Native SQL ELT support for Snowflake Cortex AI Functions, the launch of Enterprise Data Integrator (EDI) and Cloud Data Access Management (CDAM) for Snowflake. These new offerings will enable organizations to develop Generative AI (GenAI) applications, establish streamlined data integration, and provide a centralized, policy-based access management, simplifying data governance, and ensuring control over data usage.

Kumo AI announced it has launched Kumo as a Snowflake Native App in Snowflake Marketplace in a private preview. Kumo helps data scientists build highly accurate machine learning (ML) models that better predict user and customer behavior by combining graph learning over enterprise data with generative AI models trained on public data. Kumo’s Snowflake Native App is available through Snowflake Marketplace, can be paid for with Snowflake Capacity commitment for eligible customers, and is built using the Snowflake Native App Framework.

Matillion announced that it is bringing no-code Generative AI (GenAI) to Snowflake users with new GenAI capabilities and integrations with Snowflake Cortex AI, Snowflake ML Functions, and support for Snowpark Container Services. Matillion’s graphical components make it easy and near instant to incorporate GenAI into data workflows, regardless of technical expertise. To do this, Matillion abstracts SQL functions that run in Cortex AI’s fully managed infrastructure for LLMs.

Qlik announced expanded capabilities with the Snowflake Data Cloud through its adoption of Snowflake Cortex AI. Qlik now enables customers to harness the full potential of Cortex AI for advanced AI-driven analytics. Qlik’s approach allows enterprises to embed deep learning into their data workflows, enhancing decision-making and operational efficiency with capabilities such as real-time predictive analytics and intelligent data insights. Additionally, Qlik’s integration now optimizes SAP data management with Snowflake, enhancing data ingestion, transformation, and analytics.

RelationalAI announced the public availability of its Snowflake Native App on Snowflake Marketplace. Built with the Snowflake Native App Framework and Snowpark Container Services, RelationalAI enables customers to combine knowledge graphs with GenAI to get more value from their data and make better decisions.

Sigma announced that it has launched two new Snowflake Native Apps— the Sigma Data Path Snowflake Native App and the Python in Sigma Snowflake Native App. The Sigma Data Path Snowflake Native App is a connection source in Sigma and can host many of the core microservices that make up Sigma (data path) in Snowpark Container Services. With the Python in Sigma Snowflake Native App, Python users can produce dynamic applications for the business that they can access, engage with, and iterate on.

SnapLogic announced new connectivity and support for Snowflake vector data types, Snowflake Cortex, and Streamlit to help companies modernize their businesses and accelerate the creation of generative AI applications. Now customers can leverage SnapLogic’s ability to integrate business critical information into Snowflake’s high-performance cloud-based data warehouse to build and deploy LLM applications.

Snowplow announced the launch of AI Agent Event Collection and Analytics, powered by Snowflake. The new application will enable brands to better understand the impact of Generative AI investments. Building AI Agent Event Collection and Analytics on Snowflake’s AI Data Cloud has allowed Snowplow to use Snowflake Cortex AI to implement uplift modeling with LLM events and measure the incremental benefit of your LLM agents on customer conversion. This data allows e-commerce, media, financial services, and other industries investing in GenAI programs to effectively calculate ROI and identify optimizations.

Striim announced the private preview launch of a new suite of connectors for Snowflake Change Data Capture (CDC). The connectors can ingest analytical data from Snowflake in near real-time, allowing users to operationalize Snowflake data in business applications. The new CDC Reader for Snowflake solution will become generally available later this year.

Tonic.ai announced that it has launched the Tonic Textual Native App, Powered by Snowflake, in Snowflake Marketplace. Tonic.ai’s Snowflake Native App provides seamless and secure data redaction, tokenization and synthesis of unstructured free-text data, natively within joint customers’ Snowflake accounts. As a Snowflake Native App, Tonic Textual enables customers to secure unstructured Snowflake data for fine-tuning LLMs, building retrieval augmented generation (RAG) systems, and other jobs within their Snowflake account where their data already resides.

Other real-time analytics news in brief

Acante announced the launch of Acante Data Access Observer. The new solution helps Databricks users save time and frees them up to focus on innovation. Specifically, Acante Data Access Observer offers Databricks users automated and consolidated visibility into all data accesses within their environments. By leveraging this solution, customers can effortlessly identify sensitive data such as PII, PCI, and PHI across any data catalog.

Cirata announced the release of Cirata Data Migrator 2.5, which now includes native integration with the Databricks Unity Catalog. Expanding the Cirata and Databricks partnership, the new integration centralizes data governance and access control capabilities to enable faster data operations and accelerated time-to-business-value for enterprises. 

Dataloop announced its integration with NVIDIA NIM inference microservices. The integration will let Dataloop users benefit from enhanced security and control of generative AI applications and data with self-hosted deployment of the latest AI models in their choice of infrastructure, whether on-premises or in the cloud. It will also help accelerate the adoption and distribution of generative AI by providing Dataloop users with a seamless way to deploy NVIDIA-optimized and accelerated models, simplifying the process of building innovative AI applications.

Cisco announced an AI cluster solution with NVIDIA. The Cisco Nexus HyperFabric AI cluster solution combines Cisco AI-native networking with NVIDIA accelerated computing and AI software, as well as a VAST Data data store. It is designed to enable customers to focus on AI-driven innovation and new revenue opportunities rather than IT management.

In other Cisco news, the company unveiled the first of its integrations with Splunk, a Cisco company, as the organizations combine their observability technologies to accelerate full-stack observability for the entire enterprise. Through the full-stack observability journey, organizations can unlock unified visibility across any environment (on-premises, hybrid, and multi-cloud) and any stack while harnessing powerful real-time analytics for faster, more accurate detection, investigation, and response.

Databricks announced new and expanded strategic partnerships for data sharing and collaboration with industry-leading partners, including Acxiom, Atlassian, Epsilon, HealthVerity, LiveRamp, S&P Global, Shutterstock, T-Mobile, Tableau, TetraScience, and The Trade Desk. Databricks’ investments in its open data and AI ecosystem enable customers to collaborate across platforms and unlock AI innovation.

Fivetran announced the general availability of the Fivetran Managed Data Lake Service. The Fivetran Managed Data Lake Service simplifies data lake management by automatically converting customer data to popular open formats (i.e., Apache Iceberg or Delta Lake) before landing it in the data lake. When combined with Fivetran’s ongoing table management and maintenance, customers get the easy queryability and ease of use of a cloud data warehouse, with the flexibility and scale of a data lake.

Hammerspace announced that its Global Data Platform can now be used to process, store, and orchestrate data in edge compute environments. The solution combines Gryf, co-designed by SourceCode and GigaIO with Hammerspace’s Global Data Platform to bring unprecedented levels of shared AI processing power to any location, including those operating with limited or no networking infrastructure.

Immuta announced new data governance and audit capabilities for Retrieval Augmented Generation (RAG)–based GenAI solutions across multiple cloud platforms. With this release, Immuta offers a multi-layer architecture for securing, monitoring, and auditing sensitive data accessed by RAG-based AI applications. With the Immuta GenAI solution, data teams can control access to the storage layer, maintain a highly accurate and granular metadata inventory of RAG indexes, control access to RAG-based applications, and monitor and audit RAG index access.

Jitterbit provided previews of three new AI product capabilities the company will beta this year. The three upcoming AI-powered product betas include App Builder AI Assistant, which leverages AI to build or modify an application using natural language in a chatbot interface; Connector AI Assistant, which uses an AI-powered chatbot to build an iPaaS connector for an endpoint that has a REST API; and AskJB, which is an in-product AI chatbot that fields natural language queries of Jitterbit documentation and delivers precise and concise responses.

Lenovo announced the Lenovo ThinkSystem V4 portfolio of Intel-based solutions powered by Intel Xeon 6 processors, which are designed to make AI accessible while flexibly matching the specific workload needs of any business. The portfolio includes new AI-enabled solutions that ensure the right mix of AI is available to help customers seamlessly integrate AI into their workflows with new servers that are purpose-built and optimized to maximize performance and efficiency for targeted workloads.

Ontotext announced the latest version of Ontotext Metadata Studio (OMDS), a tool designed for knowledge graph enrichment through text analytics of unstructured documents. Version 3.8 aids in the creation, evaluation, and quality improvement of text analytics services.

Prophecy announced Prophecy Data Transformation Copilot for Databricks, a copilot that accelerates the preparation of raw data for analytics and AI applications. By leveraging generative AI, Prophecy Data Transformation Copilot expedites the development, deployment, and monitoring of enterprise-grade data pipelines native to the Databricks Data Intelligence Platform, ensuring the delivery of clean, trusted, and timely data for analytics.

Qlik has signed a Strategic Collaboration Agreement (SCA) with Amazon Web Services (AWS) to help simplify the path towards reliable and trusted enterprise AI. Together, the companies will combine Qlik’s solutions with AWS cloud and generative AI technologies. The work will help organizations drive AI app development, enhance SAP data utilization, streamline data compliance, and accelerate AI technology adoption.

In other Qlik news, the company unveiled Qlik Talend Cloud and Qlik Answers. These new solutions will accelerate enterprise AI adoption, with Qlik Talend Cloud delivering no-code to pro-code, AI-augmented data integration capabilities, and Qlik Answers offering AI-generated answers with full explainability from unstructured data.

VAST Data announced it is collaborating with Cisco and NVIDIA on a solution to offer businesses simple, high-performance AI infrastructure coupled with an Ethernet-based AI fabric to handle large volumes of data at exabyte scale. Through this new relationship, the VAST Data Platform became the first data platform optimized and supported by Cisco Nexus HyperFabric.

If your company has real-time analytics news, send your announcements to [email protected].

In case you missed it, here are our most recent previous weekly real-time analytics news roundups:

Salvatore Salamone

About Salvatore Salamone

Salvatore Salamone is a physicist by training who has been writing about science and information technology for more than 30 years. During that time, he has been a senior or executive editor at many industry-leading publications including High Technology, Network World, Byte Magazine, Data Communications, LAN Times, InternetWeek, Bio-IT World, and Lightwave, The Journal of Fiber Optics. He also is the author of three business technology books.

Leave a Reply

Your email address will not be published. Required fields are marked *