SHARE

Real-time Analytics News for the Week Ending April 5

In this week’s real-time analytics news: MLCommons announced new results for the MLPerf Inference v6.0 benchmark suite.

Written By

Apr 5, 2026

Keeping pace with news and developments in the real-time analytics and AI market can be a daunting task. Fortunately, we have you covered with a summary of the items our staff comes across each week. And if you prefer it in your inbox, sign up here!

MLCommons announced new results for its industry-standard MLPerf Inference v6.0 benchmark suite. This release includes several important advances that ensure the benchmark suite tests current, real-world scenarios for AI deployments and delivers a comprehensive picture of AI system performance.

The new benchmark received submissions from a total of 24 participating organizations, including AMD, ASUSTeK, Cisco, CoreWeave, Dell, GATEOverflow, GigaComputing, Google, Hewlett Packard Enterprise, Intel, Inventec Corporation, KRAI, Lambda, Lenovo, MangoBoost, MiTAC, Nebius, Netweb Technologies India Limited, NVIDIA, Oracle, Quanta Cloud Technology, Red Hat, Stevens Institute of Technology, and Supermicro.

Additionally, this round recorded a new high for multi-node system submissions, a 30% increase over the Inference 5.1 benchmark six months ago. Moreover, 10% of all of the submitted systems in Inference 6.0 had more than ten nodes, compared to only 2% in the previous round. The largest system submitted in Inference 6.0 featured 72 nodes and 288 accelerators, quadrupling the number of nodes in the largest system in the previous round.

Keeping Pace with Industry Changes

Five of the eleven datacenter tests in MLPerf Inference v6.0 are new or updated, and the release also includes a new object-detection test for edge systems. The major changes include:

A new, open-weight large-language model benchmark based on GPT-OSS 120B that can be used for mathematics, scientific reasoning, and coding.
An expanded DeepSeek-R1 advanced-reasoning benchmark, including an interactive scenario that permits speculative decoding.
DLRMv3, the third generation of our recommender benchmark and now the first sequential recommendation benchmark test in the suite, is thoroughly modernized based on generous engineering contributions from Meta, a world leader in recommender systems.
The suite’s first text-to-video generation benchmark.
A new vision-language model (VLM) benchmark that transforms unstructured multi-modal data from Shopify’s extensive product Catalog into structured metadata.
An upgraded single-shot object detection benchmark for edge scenarios based on Ultralytics’ YOLOv11 Large model.

Real-time analytics news in brief

Codenotary announced the launch of AgentMon, an enterprise-grade monitor designed specifically for agentic networks, providing organizations with real-time visibility into the security, performance, and cost of AI-driven agents operating across the enterprise. The solution delivers continuous, end-to-end monitoring of agentic networks. The platform provides AI operations teams, security leaders, and compliance managers with a unified view of how agents behave, what resources they consume, and whether they are operating within defined policies.

DomainTools announced the general availability of its Model Context Protocol (MCP) server, a hosted, production-ready integration that connects AI agents and Large Language Models (LLMs) directly to DomainTools’ domain intelligence datasets. Security teams can now retrieve Risk Scores, hosting history, passive DNS records, and infrastructure connections through natural language prompts, without switching tools or writing custom queries. The solution is a hosted, fully managed service built to meet the security, reliability, and scale requirements of enterprise environments.

LUMI AI Factory launched its Dataset-as-a-Service (DaaS) solution, which brings data and compute closer together in a way that directly meets the growing needs of AI and data-intensive research. Specifically, the service brings together metadata, access rights, and data locations into a single whole, making datasets not only discoverable but immediately usable on the LUMI supercomputer. This is especially important in AI development, where training models require large volumes of data, and where the physical proximity of data to compute significantly affects performance and the reproducibility of workflows.

MindsDB launched MindsDB Anton, an autonomous, open-source BI agent that turns urgent questions into immediate, defensible answers while keeping security, oversight, and corporate governance front and center. The solution converts plain-language requests into comprehensive outputs, including tables, interactive charts, and shareable dashboards, in a single interaction. Analysts set access guardrails, validate outputs, and manage rules so that automated speed never undermines human judgment.

pgEdge announced general availability of the pgEdge MCP Server for Postgres. This full-featured and production-ready MCP server is designed for developers building Agentic AI applications in environments with strict requirements for high availability, security, data sovereignty, and global deployment. The solution works with new and existing databases running any standard version of Postgres (v14 and newer), and offers flexible deployment options, including on-premises (even air-gapped), self-managed cloud, or a managed cloud service via pgEdge Cloud.

Rafay Systems announced the general availability of Token Factory, a suite of capabilities in the Rafay Platform that delivers token-based access to AI models and services. The solution gives AI factory operators and neoclouds the metering, pricing, and access-control capabilities needed to monetize token-based access to AI models running on accelerated computing infrastructure. With Token Factory, AI factory operators can immediately deliver token-metered access to AI models as a service through developer-friendly consumption workflows without needing to build the orchestration and monetization stack from scratch.

Redpanda announced the general availability of an adaptable data streaming engine. The solution offers a single, multi-modal platform that allows enterprises to balance performance, safety, and efficiency at the topic level. Available in Redpanda Streaming 26.1, this release eliminates the need for separate, specialized clusters and provides a unified foundation for modern data and AI workloads.

Zapier announced AI Guardrails by Zapier, a set of builder-added safety checks that run directly inside automated workflows. AI Guardrails lets teams detect personally identifiable information (PII), identify prompt injection attempts, and flag toxic or harmful content before AI outputs ever touch a CRM, database, or customer inbox. Additionally, the solution embeds real-time safety checks directly into Zaps, Agents, and MCP-connected tools.

Partnerships, collaborations, and more

IBM announced a strategic collaboration with Arm to develop new dual-architecture hardware that helps enterprises run future AI and data-intensive workloads with greater flexibility, reliability, and security. Through this collaboration, IBM and Arm aim to combine IBM’s enterprise leadership in systems reliability, security, and scalability with Arm’s power-efficient architecture, workload enablement expertise, and broad software ecosystem to build flexible and scalable computing platforms for the future.

Postman announced that Claude, Anthropic’s AI model, now powers Postman’s Agent Mode. Developers can also access their Postman workspaces directly from Anthropic’s developer tools, bringing API context into their AI-powered coding workflows. Additionally, Agent Mode is an AI-native assistant built directly into Postman and powered by Claude as the default model provider on Amazon Bedrock from Amazon Web Services (AWS), giving enterprises Claude’s performance with the security, compliance, and governance controls they require.

If your company has real-time analytics news, send your announcements to ssalamone@rtinsights.com.

In case you missed it, here are our most recent weekly real-time analytics news roundups:

Salvatore Salamone

Salvatore Salamone is a physicist by training who writes about science and information technology. During his career, he has been a senior or executive editor at many industry-leading publications including High Technology, Network World, Byte Magazine, Data Communications, LAN Times, InternetWeek, Bio-IT World, and Lightwave, The Journal of Fiber Optics. He also is the author of three business technology books.

Real-time Analytics News for the Week Ending April 5

Real-time analytics news in brief

Partnerships, collaborations, and more

Salvatore Salamone

Recommended for you...

Featured Resources from Cloud Data Insights

Company

Categories