Wednesday, June 10, 2026
This Big Influence
  • Home
  • World
  • Podcast
  • Politics
  • Business
  • Health
  • Tech
  • Awards
  • Shop
No Result
View All Result
This Big Influence
No Result
View All Result
Home Tech

Large Language Models Struggle With Medical Coding, Study Shows

ohog5 by ohog5
April 27, 2024
in Tech
0
Large Language Models Struggle With Medical Coding, Study Shows
74
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter


You might also like

A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

How can you get rid of a phobia?

CBP Used Online Ad Data to Track Phone Locations

Artificial Intelligence Robot Thinking Desk

A research from the Icahn Faculty of Medication at Mount Sinai signifies that present massive language fashions will not be but efficient for medical coding, requiring additional growth and rigorous testing earlier than scientific implementation. Credit score: SciTechDaily.com

Analysis reveals its limitations in medical coding.

Researchers on the Icahn School of Medicine at Mount Sinai have discovered that state-of-the-art synthetic intelligence programs, particularly massive language fashions (LLMs), are poor at medical coding. Their research, not too long ago revealed within the NEJM AI, emphasizes the need for refinement and validation of those applied sciences earlier than contemplating scientific implementation.

The research extracted a listing of greater than 27,000 distinctive prognosis and process codes from 12 months of routine care within the Mount Sinai Well being System, whereas excluding identifiable affected person knowledge. Utilizing the outline for every code, the researchers prompted fashions from OpenAI, Google, and Meta to output essentially the most correct medical codes. The generated codes had been in contrast with the unique codes and errors had been analyzed for any patterns.

Evaluation of Mannequin Efficiency

The investigators reported that all the studied massive language fashions, together with GPT-4, GPT-3.5, Gemini-pro, and Llama-2-70b, confirmed restricted accuracy (under 50 %) in reproducing the unique medical codes, highlighting a major hole of their usefulness for medical coding. GPT-4 demonstrated the perfect efficiency, with the best precise match charges for ICD-9-CM (45.9 %), ICD-10-CM (33.9 %), and CPT codes (49.8 %).

GPT-4 additionally produced the best proportion of incorrectly generated codes that also conveyed the right which means. For instance, when given the ICD-9-CM description “nodular prostate with out urinary obstruction,” GPT-4 generated a code for “nodular prostate,” showcasing its comparatively nuanced understanding of medical terminology. Nevertheless, even contemplating these technically appropriate codes, an unacceptably massive variety of errors remained.

The subsequent best-performing mannequin, GPT-3.5, had the best tendency towards being imprecise. It had the best proportion of incorrectly generated codes that had been correct however extra normal in nature in comparison with the exact codes. On this case, when supplied with the ICD-9-CM description “unspecified adversarial impact of anesthesia,” GPT-3.5 generated a code for “different specified adversarial results, not elsewhere categorized.”

Significance of Rigorous AI Analysis

“Our findings underscore the crucial want for rigorous analysis and refinement earlier than deploying AI applied sciences in delicate operational areas like medical coding,” says research corresponding writer Ali Soroush, MD, MS, Assistant Professor of Knowledge-Pushed and Digital Medication (D3M), and Medication (Gastroenterology), at Icahn Mount Sinai. “Whereas AI holds nice potential, it should be approached with warning and ongoing growth to make sure its reliability and efficacy in well being care.”

One potential utility for these fashions within the healthcare trade, say the investigators, is automating the project of medical codes for reimbursement and analysis functions based mostly on scientific textual content.

“Earlier research point out that newer massive language fashions battle with numerical duties. Nevertheless, the extent of their accuracy in assigning medical codes from scientific textual content had not been completely investigated throughout totally different fashions,” says co-senior writer Eyal Klang, MD, Director of the D3M’s Generative AI Analysis Program. “Subsequently, our intention was to evaluate whether or not these fashions might successfully carry out the basic process of matching a medical code to its corresponding official textual content description.”

The research authors proposed that integrating LLMs with professional information might automate medical code extraction, doubtlessly enhancing billing accuracy and decreasing administrative prices in well being care.

Conclusion and Subsequent Steps

“This research sheds gentle on the present capabilities and challenges of AI in well being care, emphasizing the necessity for cautious consideration and extra refinement previous to widespread adoption,” says co-senior writer Girish Nadkarni, MD, MPH, Irene and Dr. Arthur M. Fishberg Professor of Medication at Icahn Mount Sinai, Director of The Charles Bronfman Institute of Customized Medication, and System Chief of D3M.

The researchers warning that the research’s synthetic process could not absolutely characterize real-world situations the place LLM efficiency could possibly be worse.

Subsequent, the analysis workforce plans to develop tailor-made LLM instruments for correct medical knowledge extraction and billing code project, aiming to enhance high quality and effectivity in healthcare operations.

Reference: “Massive Language Fashions Are Poor Medical Coders — Benchmarking of Medical Code Querying” by Ali Soroush, Benjamin S. Glicksberg, Eyal Zimlichman, Yiftach Barash, Robert Freeman, Alexander W. Charney, Girish N Nadkarni and Eyal Klang, 19 April 2024, NEJM AI.
DOI: 10.1056/AIdbp2300040

This analysis was supported by the AGA Analysis Basis’s 2023 AGA-Amgen Fellowship to-School Transition Award AGA2023-32-06 and an NIH UL1TR004419 award.





Source link

Tags: CodingLanguagelargemedicalModelsShowsstruggleStudy
Share30Tweet19
ohog5

ohog5

Recommended For You

A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

by ohog5
March 8, 2026
0
A Machine Learning Engineer Thought He Was Safe From AI Layoffs. Then He Got Some Depressing News

Signal as much as see the long run, right now Can’t-miss improvements from the bleeding fringe of science and tech Whereas the precise influence of AI on the...

Read more

How can you get rid of a phobia?

by ohog5
March 8, 2026
0
How can you get rid of a phobia?

An skilled has solutions for you about what phobias are and how one can eliminate them. Within the Alfred Hitchcock basic movie Vertigo, the protagonist John “Scottie” Ferguson,...

Read more

CBP Used Online Ad Data to Track Phone Locations

by ohog5
March 7, 2026
0
CBP Used Online Ad Data to Track Phone Locations

America and Israel launched a war in Iran final week that has already killed greater than 1,200 Iranians and spilled out across the Middle East. There are many...

Read more

How “Empty Space” Is Supercharging Atomically Thin Semiconductors

by ohog5
March 6, 2026
0
How “Empty Space” Is Supercharging Atomically Thin Semiconductors

A single layer of atoms could seem too skinny to meaningfully work together with gentle, but supplies like tungsten disulfide are reshaping what is feasible in nanophotonics. Researchers...

Read more

Thousands of Everyday Drone Pilots Are Making a Google Street View From Above

by ohog5
March 6, 2026
0
Thousands of Everyday Drone Pilots Are Making a Google Street View From Above

Gaspard-Félix Tournachon, popularly referred to as “Nadar,” took the first known aerial photographs utilizing a digicam connected to a hot-air balloon simply outdoors Paris in 1858. Ever since,...

Read more
Next Post
These Incredibly Popular Drugs Have Been Linked to Migraines

These Incredibly Popular Drugs Have Been Linked to Migraines

Leave a Reply

Your email address will not be published. Required fields are marked *

Related News

AI Is Being Used to ‘Turbocharge’ Scams

AI Is Being Used to ‘Turbocharge’ Scams

June 5, 2023
World News in Brief: Rights chief ‘horrified’ at deadly PNG violence, Lebanon-Israel ‘knife edge’, Sudan refugees suffer sexual violence | Department of Political and Peacebuilding Affairs – Department of Political and Peacebuilding Affairs

Live Briefing: Gaza cease-fire talks set to resume; Iran braces for Israeli retaliation – The Washington Post

October 25, 2024
UK’s Metro Bank attracts takeover interest from private equity

UK’s Metro Bank attracts takeover interest from private equity

June 15, 2025

Browse by Category

  • Business
  • Health
  • Politics
  • Tech
  • World

Recent News

These Tiny Gut Particles Could Be Accelerating Aging Throughout the Body

These Tiny Gut Particles Could Be Accelerating Aging Throughout the Body

June 9, 2026
This Simple Drink Could Help Calm the Inflammation Behind Many Diseases

This Simple Drink Could Help Calm the Inflammation Behind Many Diseases

June 7, 2026

CATEGORIES

  • Business
  • Health
  • Politics
  • Tech
  • World

Follow Us

Recommended

  • These Tiny Gut Particles Could Be Accelerating Aging Throughout the Body
  • This Simple Drink Could Help Calm the Inflammation Behind Many Diseases
  • Leveraging Real-World Data for Proactive Protocol Design
  • The Mineral Matrix and How it Changes Everything
No Result
View All Result
  • Home
  • World
  • Podcast
  • Politics
  • Business
  • Health
  • Tech
  • Awards
  • Shop

© 2023 ThisBigInfluence

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?