Interesting

AI tools show limitations in diagnosing atypical emergency room cases

Artificial intelligence tools can assist emergency room physicians in accurately predicting disease but only for patients with typical symptoms, West Virginia University scientists have found.

Gangqing "Michael" Hu, assistant professor in the WVU School of Medicine Department of Microbiology, Immunology and Cell Biology and director of the WVU Bioinformatics Core facility, led a study that compared the precision and accuracy of four ChatGPT models in making medical diagnoses and explaining their reasoning.

His findings, published in the journal Scientific Reports, demonstrate the need for incorporating greater amounts of different types of data in training AI technology to assist in disease diagnosis.

More data can make the difference in whether AI gives patients the correct diagnoses for what are called "challenging cases," which don't exhibit classic symptoms. As an example, Hu pointed to a trio of scenarios from his study involving patients who had pneumonia without the typical fever.

In these three cases, all of the GPT models failed to give an accurate diagnosis. That made us dive in to look at the physicians' notes and we noticed the pattern of these being challenging cases. ChatGPT tends to get a lot of information from different resources on the internet, but these may not cover atypical disease presentation." 

Gangqing "Michael" Hu, Assistant Professor, WVU School of Medicine Department of Microbiology, Immunology and Cell Biology 

The study analyzed data from 30 public emergency department cases, which for reasons of privacy did not include demographics.

Hu explained that in using ChatGPT to assist with diagnosis, physicians' notes are uploaded, and the tool is asked to provide its top three diagnoses. Results varied for the versions Hu tested: the GPT-3.5, GPT-4, GPT-4o and o1 series.

"When we looked at whether the AI models gave the correct diagnosis in any of their top three results, we didn't see a significant improvement between the new version and the older version," he said. "But when we look at each model's number one diagnosis, the new version is about 15% to 20% higher in accuracy than the older version."

Given AI models' current low performance on complex and atypical cases, Hu said human oversight is a necessity for high-quality, patient-centered care when using AI as an assistive tool.

"We didn't do this study out of curiosity to see if the new model will give better results. We wanted to establish a basis for future studies that involve additional input," Hu said. "Currently, we input physician notes only. In the future we want to improve the accuracy by including images and findings from laboratory tests."

Hu also plans to expand on findings from one of his recent studies in which he applied the ChatGPT-4 model to the task of role playing a physiotherapist, psychologist, nutritionist, artificial intelligence expert and athlete in a simulated panel discussion about sports rehabilitation. 

He said he believes a model like that can improve AI's diagnostic accuracy by taking a conversational approach in which multiple AI agents interact.

"From a position of trust, I think it's very important to see the reasoning steps," Hu said. "In this case, high-quality data including both typical and atypical cases helps build the trust."

Hu emphasized that while ChatGPT is promising, it is not a certified medical device. He said if health care providers were to include images or other data in a clinical setting, the AI model would be an open-source system and installed in a hospital cluster to comply with privacy laws.

Other contributors to the study were Jinge Wang, a postdoctoral fellow, and Kenneth Shue, a lab volunteer from Montgomery County, Maryland, both in the School of Medicine Department of Microbiology, Immunology and Cell Biology; as well as Li Liu, Arizona State University. The work was supported by funding from the National Institutes of Health and National Science Foundation.

Hu said future research on using ChatGPT in emergency departments could examine whether enhancing AIs' abilities to explain their reasoning could contribute to triage or decisions about patient treatment.

Source:

West Virginia University

Journal reference:

Wang, J., et al. (2025). Preliminary evaluation of ChatGPT model iterations in emergency department diagnostics. Scientific Reports. doi.org/10.1038/s41598-025-95233-1.


Source: http://www.news-medical.net/news/20250523/AI-tools-show-limitations-in-diagnosing-atypical-emergency-room-cases.aspx

Inline Feedbacks
View all comments
guest

FOXP4 gene variants reveal new genetic link to long COVID risk

A landmark study uncovers how a specific lung gene, FOXP4, raises the risk of persistent symptoms after COVID-19,...

Confocal microscopy may help identify biomarkers for chemotherapy-induced neuropathy

A University of Arizona Comprehensive Cancer Center researcher received a $2.4 million National Cancer Institute grant to develop a noninvasive, confocal microscope...

ESMO releases updated scale to measure clinical benefit of cancer treatments

The European Society for Medical Oncology (ESMO) is pleased to announce the publication of the latest version of...

Study reveals continuing and worrying trend in excess US deaths

There were over 1.5 million "missing Americans" in 2022 and 2023, deaths that would have been averted if...

Targeting individual frailty traits may prevent falls among the elderly

A new research paper was published in Aging (Aging-US) Volume 17, Issue 4, on April 1, 2025, titled "Examining frailty...

Brain stem nerve cells hold key to safer weight loss treatments

A specific group of nerve cells in the brain stem appears to control how semaglutide affects appetite and...

Wayne State research team tracks effects of bullying from high school to college

With funding from the Spencer Foundation, a private foundation focused on funding education studies, a Wayne State University...

Air pollution’s chemical punch alters immune markers in pregnant women, study finds

New research reveals that it’s not just the amount, but the oxidative power of air pollution that shifts...

Long-term study confirms safety and effectiveness of rivaroxaban for children

Venous thromboembolism (VTE) is a life-threatening complication in children with serious underlying conditions such as heart defects or...

Molecular Devices launches automated QPix FLEX Microbial Colony Picking System

Molecular Devices, LLC., a leading high-performance life science solutions provider, today launched the QPix® FLEX™ Microbial Colony Picking System....

Social connection remains an overlooked health factor, research shows

Research confirms that social isolation and loneliness significantly impact health and mortality, even if not listed on death...

Oral microbiota transmission linked to shared depression and anxiety in couples

Background and objectives Oral microbiota dysbiosis and altered salivary cortisol levels have been linked to depression and anxiety....

Loss of automatic reenrollment leads to drop in health insurance coverage

Researchers from the University of Pittsburgh, University of South Carolina and Emory University have published findings in JAMA...

Detecting balance impairments early could prevent life-threatening falls

As we get older, our bodies stop performing as they once did. We aren't as strong as we...

Sartorius octet® r8e: Revolutionizing biomolecular research

The life science group Sartorius launches the new Octet® R8e biolayer interferometry (BLI) system, providing researchers with its...

Cutting back on sugary drinks may protect men’s fertility, review finds

Emerging evidence links regular sugary drink intake to impaired sperm quality and DNA damage. Find out why experts...

Integrating phytomedicine and nanotechnology in managing COVID-19 related heart disease

Acute coronary syndrome (ACS) in patients with SARS-CoV-2 infection represents a critical intersection of viral-induced inflammation and cardiovascular...

Trump’s team cited safety in limiting covid shots. patients, health advocates see more risk.

Larry Saltzman has blood cancer. He's also a retired doctor, so he knows getting covid-19 could be dangerous...

NUS researchers develop breakthrough gene delivery technology for immune cells

Researchers at the National University of Singapore (NUS) have developed a scalable, non-viral technology that efficiently delivers genetic...

Advancing GPCR Drug Discovery with Fragment Screening

Thought LeadersEdoardo FabiniPrincipal Scientist Evotec U.K. G-protein-coupled receptors (GPCRs) play a pivotal role in cellular signaling and have long...