SHARE
Facebook X Pinterest WhatsApp

Benchmark from NVIDIA Creates Rigorous New AI Test

thumbnail
Benchmark from NVIDIA Creates Rigorous New AI Test

New Model Technology Business Background

The new benchmark could mark new advances in computer vision and tasks, allowing companies to rely more heavily on machines for mundane or dangerous labor.

Dec 2, 2020

Computer vision tests up to this point showed machines falling far behind human evaluators in basic visual recognition. For AI, success meant correct object identification less than 70% of the time compared to human subjects at 99%. Now, a new benchmark from NIVIDA, based on a 50-year-old Russian concept, could launch a new era of artificial intelligence.

Russian computer scientist M.M. Bongard invented a collection of 100 human designed tasks designed to tease out artificial intelligence capability by providing a foundation for assessing cognition. These problems, known as Bongard Problems (BP), have been a standard for years.

See also: Businesses Outfit Cameras with AI to Prevent Coronavirus Spread

BP isn’t designed for state-of-the-art machine learning because of its small size and reliance on natural language. NVIDIA’s work aims to overcome these limitations and provide a new benchmark that better addresses state of the art machine and deep learning.

The new benchmark, called Bongard-LOGO, expands the test sets to 12,000 problem instances spanning across three different tasks:

  • Free-form shape problems: The machine must induce underlying shape problems to determine whether test images match the generated programs.
  • Basic-shape problems: Tests analogy making for features that cause problems for machines in basic shapes but aren’t an issue with free-form shapes.
  • Abstract-shape problems: Assesses the machine’s ability to discover shapes and reason. It prevents the machine from memorizing and instead forces some understanding of the underlying concept.

These three areas help us better understand what a machine is capable of recognizing beyond simple memorization. Humans can draw conclusions and make interpretations beyond the simple line and shape, finding patterns, and making analogies. Now, these tasks create a stronger benchmark for machines to do the same.

What the benchmark means for business

Computer vision is the next greatest capability for various business models that still rely heavily on human labor and intervention. A prime example is found in manufacturing. For example, manufacturing still requires humans to perform rote tasks because of the machine’s inability to assess basic shapes.

These new benchmarks could mark new advances in computer vision and tasks, allowing companies to rely more heavily on machines for mundane or dangerous labor. We’ll be watching what new advances arise from NVIDIA’s testing.

thumbnail
Elizabeth Wallace

Elizabeth Wallace is a Nashville-based freelance writer with a soft spot for data science and AI and a background in linguistics. She spent 13 years teaching language in higher ed and now helps startups and other organizations explain - clearly - what it is they do.

Recommended for you...

Why Satellite Connectivity Sits at the Heart of Enterprise Network Resilience
Fánan Henriques
Feb 14, 2026
On a Trust-Building Trajectory: AI in Network Automation
Brad Haas
Feb 12, 2026
Five Reasons Why DataOps Automation Is Now an Essential Discipline
Keith Belanger
Feb 5, 2026
Real-Time RAG Pipelines: Achieving Sub-Second Latency in Enterprise AI
Abhijit Ubale
Jan 28, 2026

Featured Resources from Cloud Data Insights

Why Satellite Connectivity Sits at the Heart of Enterprise Network Resilience
Fánan Henriques
Feb 14, 2026
Cleaning up the Slop: Will Backlash to “AI Slop” Increase This Year?
Henry Young
Feb 13, 2026
How Data Hydration Enables Scalable and Trusted AI
Peter Harris
Feb 12, 2026
On a Trust-Building Trajectory: AI in Network Automation
Brad Haas
Feb 12, 2026
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.