Why language models hallucinate [OpenAI Research Paper]

jai · September 5, 2025, 8:32pm

OpenAI just published a new paper on hallucinations in Language Models.

tldr: “Our new research paper ⁠(opens in a new window) argues that language models hallucinate because standard training and evaluation procedures reward guessing over acknowledging uncertainty.”

Wanted to start a new topic to dicuss this paper, and also some of the ways the community is working to reduce hallucinations in outputs, whether using the API or on ChatGPT.

I’ll start by saying that one common way I’ve seen discussed in general is to give the model an ‘out’, by allowing it to skip answering a question if it doesn’t know the answer, by specifiying in the prompt: “If you don’t know the answer, say so” (or some variation of this). This has yielded mixed results. What other ways have you been using to detect and reduce occurences of halucinations?

Reference Links:
Webpage: https://openai.com/index/why-language-models-hallucinate/
Paper: https://cdn.openai.com/pdf/d04913be-3f6f-4d2b-b283-ff432ef4aaa5/why-language-models-hallucinate.pdf

windysoliloquy · September 5, 2025, 9:10pm

you want it to not hallucinate?
don’t make it role play, the act of doing so automatically creates such outs for ‘hallucinations’ or guessing.

in your context file of how it should behave it needs to demand to be honest always, to not roleplay, to not simulate, and to state specifically if information is not available.

problem solved.

I’ve been running like this for months and haven’t had a hitch since…

it’s that role-playing thing people get it to do that opens the door the most in my understanding.

garyk · September 5, 2025, 9:47pm

@jai Thank you for sharing, it’s an interesting piece by OpenAI. You can ask it to avoid making things up and mostly it will follow that though there tends to be some drift.

If you’re interested, I made 2 custom GPTs to help ID hallucinations in the responses. You’re welcome to try them out here:

Diagnostics Mode GPT

Hallucination Check

Feel free to suggest feedback, thoughts, etc. Good luck!

–Gary

Topic		Replies	Views
How can we prevent large language models like GPT-4 from hallucinating? Community chatgpt	2	955	December 2, 2024
Addressing confabulation Community	2	819	June 10, 2021
Why is my fine-tuned model hallucinating? Community fine-tuning	2	2305	October 6, 2023
OpenAI API for medical purposes Community	15	16201	October 29, 2023
User Guidelines for Dealing with Hallucinations Prompting chatgpt	3	1196	July 15, 2025

Why language models hallucinate [OpenAI Research Paper]

Related topics