OpenAI claims new ChatGPT model outperforms physicians on clinical tasks

Advertisement

OpenAI has launched ChatGPT for Clinicians, a free AI tool designed specifically to support every day tasks associated with medical practice. 

The company’s new benchmark accompanying the tool — HealthBench Professional – claims its model outperforms human physicians on clinical tasks, The Decoder reported April 23. Here are seven things to know about the benchmark and new ChatGPT model:

1. HealthBench Professional measures AI performance across three clinical areas: consultations, writing and documentation and medical research. It builds on the earlier HealthBench and uses physician-written conversations, multi-level physician scoring and targeted data filtering. 

2. About one third of the examples in the AI benchmark come from targeted “red teaming” where physicians actively tried to find weaknesses in the models. The hardest conversations were overrepresented by a factor of 3.5.

3. GPT-5.4 running in the ChatGPT for Clinicians workspace scored 59.0 overall on HealthBench Professional. Physician-written responses came in at 43.7, even with unlimited time and internet access.   

4. Every other model tested scored well below the Clinicians version: the base GPT-5.4 hit 48.1, Anthropic’s Claude Opus 4.7 reached 47.0, Google’s Gemini 3.1 Pro scored 43.8 and xAI’s Grok 4.2 landed at 36.1. 

5. GPT-5.4 in the Clinicians workspace scores about 11 points higher than the base GPT-5.4 (59.0 vs. 48.1).  

6. The report notes that OpenAI built the benchmark and tested its own models on it, indicating a methodological conflict. 

7. OpenAI says ChatGPT for Clinicians was developed with hundreds of medical advisors. Before launch, doctors tested 6,924 conversations in their everyday clinical work, and 99.6 percent of the responses were rated safe and accurate. 

At the Becker's 23rd Annual Spine, Orthopedic and Pain Management-Driven ASC + The Future of Spine Conference, taking place June 18–20 in Chicago, spine surgeons, orthopedic leaders and ASC executives will come together to explore minimally invasive techniques, ASC growth strategies and innovations shaping the future of outpatient spine care. Apply for complimentary registration now.

Advertisement

Next Up in Digital Health

Advertisement