Thursday, March 12, 2026
This Big Influence
  • Home
  • World
  • Podcast
  • Politics
  • Business
  • Health
  • Tech
  • Awards
  • Shop
No Result
View All Result
This Big Influence
No Result
View All Result
Home Tech

Large Language Models Struggle With Medical Coding, Study Shows

ohog5 by ohog5
April 27, 2024
in Tech
0
Large Language Models Struggle With Medical Coding, Study Shows
74
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter


You might also like

A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

How can you get rid of a phobia?

CBP Used Online Ad Data to Track Phone Locations

Artificial Intelligence Robot Thinking Desk

A research from the Icahn Faculty of Medication at Mount Sinai signifies that present massive language fashions will not be but efficient for medical coding, requiring additional growth and rigorous testing earlier than scientific implementation. Credit score: SciTechDaily.com

Analysis reveals its limitations in medical coding.

Researchers on the Icahn School of Medicine at Mount Sinai have discovered that state-of-the-art synthetic intelligence programs, particularly massive language fashions (LLMs), are poor at medical coding. Their research, not too long ago revealed within the NEJM AI, emphasizes the need for refinement and validation of those applied sciences earlier than contemplating scientific implementation.

The research extracted a listing of greater than 27,000 distinctive prognosis and process codes from 12 months of routine care within the Mount Sinai Well being System, whereas excluding identifiable affected person knowledge. Utilizing the outline for every code, the researchers prompted fashions from OpenAI, Google, and Meta to output essentially the most correct medical codes. The generated codes had been in contrast with the unique codes and errors had been analyzed for any patterns.

Evaluation of Mannequin Efficiency

The investigators reported that all the studied massive language fashions, together with GPT-4, GPT-3.5, Gemini-pro, and Llama-2-70b, confirmed restricted accuracy (under 50 %) in reproducing the unique medical codes, highlighting a major hole of their usefulness for medical coding. GPT-4 demonstrated the perfect efficiency, with the best precise match charges for ICD-9-CM (45.9 %), ICD-10-CM (33.9 %), and CPT codes (49.8 %).

GPT-4 additionally produced the best proportion of incorrectly generated codes that also conveyed the right which means. For instance, when given the ICD-9-CM description “nodular prostate with out urinary obstruction,” GPT-4 generated a code for “nodular prostate,” showcasing its comparatively nuanced understanding of medical terminology. Nevertheless, even contemplating these technically appropriate codes, an unacceptably massive variety of errors remained.

The subsequent best-performing mannequin, GPT-3.5, had the best tendency towards being imprecise. It had the best proportion of incorrectly generated codes that had been correct however extra normal in nature in comparison with the exact codes. On this case, when supplied with the ICD-9-CM description “unspecified adversarial impact of anesthesia,” GPT-3.5 generated a code for “different specified adversarial results, not elsewhere categorized.”

Significance of Rigorous AI Analysis

“Our findings underscore the crucial want for rigorous analysis and refinement earlier than deploying AI applied sciences in delicate operational areas like medical coding,” says research corresponding writer Ali Soroush, MD, MS, Assistant Professor of Knowledge-Pushed and Digital Medication (D3M), and Medication (Gastroenterology), at Icahn Mount Sinai. “Whereas AI holds nice potential, it should be approached with warning and ongoing growth to make sure its reliability and efficacy in well being care.”

One potential utility for these fashions within the healthcare trade, say the investigators, is automating the project of medical codes for reimbursement and analysis functions based mostly on scientific textual content.

“Earlier research point out that newer massive language fashions battle with numerical duties. Nevertheless, the extent of their accuracy in assigning medical codes from scientific textual content had not been completely investigated throughout totally different fashions,” says co-senior writer Eyal Klang, MD, Director of the D3M’s Generative AI Analysis Program. “Subsequently, our intention was to evaluate whether or not these fashions might successfully carry out the basic process of matching a medical code to its corresponding official textual content description.”

The research authors proposed that integrating LLMs with professional information might automate medical code extraction, doubtlessly enhancing billing accuracy and decreasing administrative prices in well being care.

Conclusion and Subsequent Steps

“This research sheds gentle on the present capabilities and challenges of AI in well being care, emphasizing the necessity for cautious consideration and extra refinement previous to widespread adoption,” says co-senior writer Girish Nadkarni, MD, MPH, Irene and Dr. Arthur M. Fishberg Professor of Medication at Icahn Mount Sinai, Director of The Charles Bronfman Institute of Customized Medication, and System Chief of D3M.

The researchers warning that the research’s synthetic process could not absolutely characterize real-world situations the place LLM efficiency could possibly be worse.

Subsequent, the analysis workforce plans to develop tailor-made LLM instruments for correct medical knowledge extraction and billing code project, aiming to enhance high quality and effectivity in healthcare operations.

Reference: “Massive Language Fashions Are Poor Medical Coders — Benchmarking of Medical Code Querying” by Ali Soroush, Benjamin S. Glicksberg, Eyal Zimlichman, Yiftach Barash, Robert Freeman, Alexander W. Charney, Girish N Nadkarni and Eyal Klang, 19 April 2024, NEJM AI.
DOI: 10.1056/AIdbp2300040

This analysis was supported by the AGA Analysis Basis’s 2023 AGA-Amgen Fellowship to-School Transition Award AGA2023-32-06 and an NIH UL1TR004419 award.





Source link

Tags: CodingLanguagelargemedicalModelsShowsstruggleStudy
Share30Tweet19
ohog5

ohog5

Recommended For You

A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

by ohog5
March 8, 2026
0
A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

Signal as much as see the long run, right now Can’t-miss improvements from the bleeding fringe of science and tech Whereas the precise influence of AI on the...

Read more

How can you get rid of a phobia?

by ohog5
March 8, 2026
0
How can you get rid of a phobia?

An skilled has solutions for you about what phobias are and how one can eliminate them. Within the Alfred Hitchcock basic movie Vertigo, the protagonist John “Scottie” Ferguson,...

Read more

CBP Used Online Ad Data to Track Phone Locations

by ohog5
March 7, 2026
0
CBP Used Online Ad Data to Track Phone Locations

America and Israel launched a war in Iran final week that has already killed greater than 1,200 Iranians and spilled out across the Middle East. There are many...

Read more

How “Empty Space” Is Supercharging Atomically Thin Semiconductors

by ohog5
March 6, 2026
0
How “Empty Space” Is Supercharging Atomically Thin Semiconductors

A single layer of atoms could seem too skinny to meaningfully work together with gentle, but supplies like tungsten disulfide are reshaping what is feasible in nanophotonics. Researchers...

Read more

Thousands of Everyday Drone Pilots Are Making a Google Street View From Above

by ohog5
March 6, 2026
0
Thousands of Everyday Drone Pilots Are Making a Google Street View From Above

Gaspard-Félix Tournachon, popularly referred to as “Nadar,” took the first known aerial photographs utilizing a digicam connected to a hot-air balloon simply outdoors Paris in 1858. Ever since,...

Read more
Next Post
These Incredibly Popular Drugs Have Been Linked to Migraines

These Incredibly Popular Drugs Have Been Linked to Migraines

Leave a Reply

Your email address will not be published. Required fields are marked *

Related News

Community shows up for young entrepreneurs at Children’s Business Fair

Community shows up for young entrepreneurs at Children’s Business Fair

October 15, 2023
New Research Reveals How HIV Outsmarts Cellular Security

New Research Reveals How HIV Outsmarts Cellular Security

March 7, 2024
Trump Hits A New Low As His Base Splits Over Jeffrey Epstein

Trump Hits A New Low As His Base Splits Over Jeffrey Epstein

July 20, 2025

Browse by Category

  • Business
  • Health
  • Politics
  • Tech
  • World

Recent News

Scientists Discover Hidden Energy Problem in the Depressed Brain

Scientists Discover Hidden Energy Problem in the Depressed Brain

March 11, 2026
How Nabla is Powering the Next Generation of Healthcare AI

How Nabla is Powering the Next Generation of Healthcare AI

March 10, 2026

CATEGORIES

  • Business
  • Health
  • Politics
  • Tech
  • World

Follow Us

Recommended

  • Scientists Discover Hidden Energy Problem in the Depressed Brain
  • How Nabla is Powering the Next Generation of Healthcare AI
  • New AI Model Predicts Cancer Spread With Incredible Accuracy
  • Sectra Acquires Oxipit to Scale Autonomous Diagnostic Imaging
No Result
View All Result
  • Home
  • World
  • Podcast
  • Politics
  • Business
  • Health
  • Tech
  • Awards
  • Shop

© 2023 ThisBigInfluence

Cleantalk Pixel
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?