Sunday, January 25, 2026
This Big Influence
  • Home
  • World
  • Podcast
  • Politics
  • Business
  • Health
  • Tech
  • Awards
  • Shop
No Result
View All Result
This Big Influence
No Result
View All Result
Home Tech

GPT-5 Is Making Huge Factual Errors, Users Say

ohog5 by ohog5
September 9, 2025
in Tech
0
GPT-5 Is Making Huge Factual Errors, Users Say
74
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter


It has been simply over a month since OpenAI dropped its long-awaited GPT-5 giant language mannequin (LLM) — and it hasn’t stopped spewing an astonishing quantity of unusual falsehoods since then.

From the AI experts on the Discovery Institute’s Walter Bradley Middle for Synthetic Intelligence and irked Redditors on r/ChatGPTPro, to even OpenAI CEO Sam Altman himself, there’s loads of proof to recommend that OpenAI’s declare that GPT-5 boasts “PhD-level intelligence” comes with some severe asterisks.

In a Reddit publish, a user realized not solely that GPT-5 had been producing “incorrect data on fundamental info over half the time,” however that with out fact-checking, they could have missed different hallucinations.

The Reddit consumer’s expertise highlights simply how widespread it’s for chatbots to hallucinate, which is AI-speak for confidently making stuff up. Whereas the problem is far from exclusive to ChatGPT, OpenAI’s newest LLM appears to have a particular penchant for BS — a actuality that challenges the corporate’s declare that GPT-5 hallucinates less than its predecessors.

In a recent blog post about hallucinations, wherein OpenAI as soon as once more claimed that GPT-5 produces “considerably fewer” of them — the agency tried to clarify how and why these falsehoods happen.

“Hallucinations persist partly as a result of present analysis strategies set the incorrect incentives,” the September 5 publish reads. “Whereas evaluations themselves don’t instantly trigger hallucinations, most evaluations measure mannequin efficiency in a manner that encourages guessing moderately than honesty about uncertainty.”

Translation: LLMs hallucinate as a result of they’re educated to get issues proper, even when it means guessing. Although some fashions, like Anthropic’s Claude, have been educated to confess when they do not know a solution, OpenAI’s haven’t — thus, they wager incorrect guesses.

Because the Reddit consumer indicated (backed up with a link to their conversation log), they received some huge factual errors when asking in regards to the gross home product (GDP) of assorted international locations and had been offered by the chatbot with “figures that had been actually double the precise values.”

Poland, for example, was listed as having a GDP of greater than two trillion {dollars}, when in actuality its GDP, per the International Monetary Fund, is presently hovering round $979 billion. Have been we to wager a guess, we would say that that hallucination could also be attributed to recent boasts from the country’s president saying its economic system (and never its GDP) has exceeded $1 trillion.

“The scary half? I solely seen these errors as a result of some solutions appeared so off that they made me suspicious,” the consumer continued. “For example, once I noticed GDP numbers that appeared manner too excessive, I double-checked and located they had been fully incorrect.”

“This makes me surprise: What number of instances do I NOT fact-check and simply settle for the incorrect data as reality?” they mused.

In the meantime, AI skeptic Gary Smith of the Walter Bradley Middle famous that he is finished three easy experiments with GPT-5 since its launch — a modified game of tic-tac-toe, questioning about financial advice, and a request to draw a possum with 5 of its physique components labeled — to “display that GPT 5.0 was removed from PhD-level experience.”

The possum example was notably egregious, technically arising with the best names for the animal’s components however pinning them in unusual locations, resembling marking its leg as its nostril and its tail as its again left foot. When making an attempt to duplicate the experiment for a more recent post, Smith found that even when he made a typo — “posse” as an alternative of “possum” — GPT-5 mislabeled the components in a equally weird style.

As an alternative of the meant possum, the LLM generated a picture of its obvious concept of a posse: 5 cowboys, some toting weapons, with traces indicating varied components. A few of these components — the pinnacle, foot, and probably the ear — had been correct, whereas the shoulder pointed to one of many cowboys’ ten-gallon hats and the “fand,” which can be a mix-up of foot and hand, pointed at considered one of their shins.

We determined to do the same check, asking GPT-5 to offer a picture of “a posse with six body parts labeled.” After clarifying that Futurism needed a labeled picture and never a textual content description, ChatGPT went off to work — and what it spat out was, as you may see beneath, much more hilariously incorrect than what Smith received.

It appears fairly clear from this aspect of the GPT-5 launch that it is nowhere close to as good as a doctoral candidate — or, at very least, one which has any probability of really attaining their PhD.

You might also like

OnlyFans Rival Seemingly Succumbs to AI Psychosis, Which We Dare You to Try Explain to Your Parents

2 moral actions shape first impressions more than others

DOGE May Have Misused Social Security Data, DOJ Admits

The ethical of this story, it appears, is to fact-check something a chatbot spits out — or forgo utilizing AI and do the analysis for your self.

Extra on GPT-5: After Disastrous GPT-5, Sam Altman Pivots to Hyping Up GPT-6



Source link

Tags: ErrorsFactualGPT5HugeMakingusers
Share30Tweet19
ohog5

ohog5

Recommended For You

OnlyFans Rival Seemingly Succumbs to AI Psychosis, Which We Dare You to Try Explain to Your Parents

by ohog5
January 25, 2026
0
OnlyFans Rival Seemingly Succumbs to AI Psychosis, Which We Dare You to Try Explain to Your Parents

Illustration by Tag Hartman-Simkins / Futurism. Supply: Getty Photographs One thing unusual is occurring with ManyVids, an OnlyFans-like porn platform with tens of millions of customers. For roughly...

Read more

2 moral actions shape first impressions more than others

by ohog5
January 25, 2026
0
2 moral actions shape first impressions more than others

Share this Article You're free to share this text underneath the Attribution 4.0 Worldwide license. New analysis reveals that equity and respect for property form our first impressions—and...

Read more

DOGE May Have Misused Social Security Data, DOJ Admits

by ohog5
January 24, 2026
0
DOGE May Have Misused Social Security Data, DOJ Admits

Legislation enforcement authorities in the US have for years circumvented the US Constitution’s Fourth Amendment by purchasing data on US residents that might in any other case must...

Read more

Amazon Echo Studio deal: Save $30 with coupon code

by ohog5
January 24, 2026
0
Amazon Echo Studio deal: Save $30 with coupon code

SAVE $30: As of Jan. 23, the Amazon Echo Studio is on sale for $189.99 with the on-page coupon code ECHOSTUDIO30. That is a financial savings of about...

Read more

Twisting a Crystal at the Nanoscale Changes How Electricity Flows

by ohog5
January 23, 2026
0
Twisting a Crystal at the Nanoscale Changes How Electricity Flows

Scientists have proven that twisting a crystal on the nanoscale can flip it right into a tiny, reversible diode, hinting at a brand new period of shape-engineered electronics....

Read more
Next Post
Why One Brain Circuit Collapses First in Alzheimer’s

Why One Brain Circuit Collapses First in Alzheimer’s

Related News

Scientists Identify New Health Benefits of Potatoes

Scientists Identify New Health Benefits of Potatoes

August 15, 2024
World News in Brief: Rights chief ‘horrified’ at deadly PNG violence, Lebanon-Israel ‘knife edge’, Sudan refugees suffer sexual violence | Department of Political and Peacebuilding Affairs – Department of Political and Peacebuilding Affairs

World News Live Today December 23, 2024: Syria's new leader says militias will join Army, state to control ‘all weapons’ – Hindustan Times

December 22, 2024
World News in Brief: Rights chief ‘horrified’ at deadly PNG violence, Lebanon-Israel ‘knife edge’, Sudan refugees suffer sexual violence | Department of Political and Peacebuilding Affairs – Department of Political and Peacebuilding Affairs

Not reconsidering Matt Gaetz nomination for attorney general, says Trump – Business Standard

November 20, 2024

Browse by Category

  • Business
  • Health
  • Politics
  • Tech
  • World

Recent News

OnlyFans Rival Seemingly Succumbs to AI Psychosis, Which We Dare You to Try Explain to Your Parents

OnlyFans Rival Seemingly Succumbs to AI Psychosis, Which We Dare You to Try Explain to Your Parents

January 25, 2026
Cartoon: Sanctuary Seahawks

Cartoon: Sanctuary Seahawks

January 25, 2026

CATEGORIES

  • Business
  • Health
  • Politics
  • Tech
  • World

Follow Us

Recommended

  • OnlyFans Rival Seemingly Succumbs to AI Psychosis, Which We Dare You to Try Explain to Your Parents
  • Cartoon: Sanctuary Seahawks
  • 2 moral actions shape first impressions more than others
  • Spice Bazaar celebrates its one year anniversary at store in Salisbury – delmarvanow.com
No Result
View All Result
  • Home
  • World
  • Podcast
  • Politics
  • Business
  • Health
  • Tech
  • Awards
  • Shop

© 2023 ThisBigInfluence

Cleantalk Pixel
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?