AI Chatbot Outperforms Human Clinicians in Probabilistic Diagnosis

thumbnail
AI Chatbot Outperforms Human Clinicians in Probabilistic Diagnosis

Doctor in futuristic medical concept pressing button

A study by researchers at Beth Israel Deaconess Medical Center found that when test results were positive, the AI-enabled chatbot’s diagnostic accuracy was comparable to that of human clinicians in most cases.

Feb 6, 2024

A recent study conducted by physician-investigators at Beth Israel Deaconess Medical Center (BIDMC) has compared the probabilistic reasoning abilities of a chatbot to those of human clinicians. The findings, published in JAMA Network Open, suggest that artificial intelligence (AI) has the potential to serve as a valuable clinical decision-support tool for physicians.

AI’s Role in Probabilistic Reasoning

The study’s author, Dr. Adam Rodman, emphasized the challenge humans face in probabilistic reasoning, which involves making decisions based on calculating odds. Probabilistic reasoning is a crucial component of the complex process of diagnosis. Dr. Rodman’s team chose to evaluate this aspect in isolation because it represents an area where humans could benefit from support.

The study was based on a national survey involving more than 550 practitioners who performed probabilistic reasoning on five medical cases. The researchers then used the publicly available Large Language Model (LLM) Chat GPT-4 to analyze the same cases. The chatbot estimated the likelihood of specific diagnoses based on patient presentations and updated its estimates when test results were introduced.

The study revealed that when test results were positive, the chatbot’s diagnostic accuracy was comparable to that of human clinicians in most cases. However, when test results were negative, the chatbot consistently outperformed human clinicians in all five cases. This highlights the chatbot’s ability to maintain a more accurate assessment after receiving negative test results.

See also: A Chatbot Without Personalization Has No Purpose. Here’s Why!

Advertisement

Chatbot Impact on Clinical Decision-Making

Dr. Rodman is interested in how the availability of AI support tools like chatbots might influence the performance of highly skilled physicians in clinical settings. While acknowledging that LLMs do not calculate probabilities the same way as experts, he believes that their integration into clinical workflows could lead to improved decision-making by human clinicians.

Co-authors of the study included experts from the University of Massachusetts Amherst, Harvard Medical School, and the University of Maryland School of Medicine. Grants from organizations such as the Gordon and Betty Moore Foundation and the Department of Veterans Affairs, among others, supported the research.

thumbnail
Elizabeth Wallace

Elizabeth Wallace is a Nashville-based freelance writer with a soft spot for data science and AI and a background in linguistics. She spent 13 years teaching language in higher ed and now helps startups and other organizations explain - clearly - what it is they do.

Recommended for you...

Real-time Analytics News for the Week Ending March 7
Real-time Analytics News for the Week Ending February 28
IBM’s New Acquisition Highlights Organizations Aren’t Ready for Real-Time
Max Vermeir
Feb 24, 2026
Real-time Analytics News for the Week Ending February 21

Featured Resources from Cloud Data Insights

Agentic AI and the Death of SaaS
The Business Case for a Unified Semantic Layer
Alex Merced
Mar 12, 2026
Domain-Specific LLMs: How to Make AI Useful for Your Business
Hardik Parikh
Mar 11, 2026
Engineering the Agentic Enterprise: Building Smarter, Adaptive, Autonomous Systems
Varun Goswami
Mar 10, 2026
RT Insights Logo

Analysis and market insights on real-time analytics including Big Data, the IoT, and cognitive computing. Business use cases and technologies are discussed.

Property of TechnologyAdvice. © 2026 TechnologyAdvice. All Rights Reserved

Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.