Suggestion: ChatGPT should attach an estimate of accuracy to every question answered, for example, "Estimated accuracy: 93%"

bakiabakianyor · August 24, 2024, 2:33pm

I have a suggestion for ChatGPT: attach an estimate of accuracy to every question answered. For example, “Estimated accuracy: 93%”.

There is a precedent for this on the web site https://openai.com/. If you perform a search on openai.com, you will receive search results with a “percentage match” attached to each.

Thank you for considering my suggestion.

Diet · August 24, 2024, 7:37pm

Welcome to the community!

From experience, I’d say that unfortunately it’s not quite that easy

Every new token (or word) that ChatGPT spits out comes with its own probability, you can call it certainty, if you want. Unfortunately, that probability only refers to that word, and isn’t indicative of any factual certainty. What we’d all like is a sort of “groundedness” predictor (how grounded the response is in reality) - but current bleeding edge approaches make the response ~5-10x more expensive.

The percentage match you’re seeing with a lot of search engines works a little differently - there are multiple methods, but for example with LLM based search (embeddings) - you calculate a match angle. 0° is a perfect match, 90° is something completely unrelated. You can map that into percentages, but that doesn’t pertain to LLM outputs/generations - only raw (document) retrievals.

If you find this interesting and want to play with this stuff I do recommend you check out the APIs on platform.openai.com!

bakiabakianyor · September 10, 2024, 9:35am

Thanks for sharing your thoughts on this. The problem I am addressing is that, at the moment, ChatGPT sometimes gives incorrect answers in an authoritative manner. In other words, with 100% confidence. That’s not only misleading, it can be dangerous.

I gave the percentage-match that search engines use simply as illustration. To address the problem, “groundedness” is fine. But why not just “% Confidence”, “% Reality” or “% Accuracy”?

Diet · September 10, 2024, 10:17am

Oh yeah, I agree with you. What it’s ultimately gonna be called won’t matter - but the probabilities we currently get out don’t mean what some people think they mean.

I’m just saying it’s a hard problem that can’t be solved that easily at the moment. But it probably will be at some point!

Topic		Replies	Views
A confidence level associated with GPT responses API	5	5633	July 11, 2024
Answering questions better with less access to confirmation bias API	0	51	February 24, 2025
Problem with ChatGPT responses Prompting gpt-35-turbo , chatgpt , api	7	1621	September 22, 2023
How can I get ChatGPT to proactively tell me when an answer is mainly speculation or an educated guess, rather than one from an official source? Prompting chatgpt	12	1600	January 13, 2025
ChatGPT API integration on Apps for devs API chatgpt , api	8	585	October 4, 2023

Suggestion: ChatGPT should attach an estimate of accuracy to every question answered, for example, "Estimated accuracy: 93%"

Related topics