GPT-4-Turbo consistently getting a number wrong

crumrinece · April 3, 2024, 2:05am

I’ve come across a scenario where I’m trying to extract key details from candidates interview notes. Using the latest GPT-4-Turbo preview model available on Azure OpenAI & function calling to describe as a JSON schema the notes I want pulled out.

The notes contain a reference to growing revenue from $25 - $50 million over the time period.

The output of the function call response consistently contains a reference to this, but says $24 - $50 million instead.

It does this every time.

I’ve even added a “citation” property to the JSON schema and updated the prompt with instructions to include a snippet of text extracted exactly from the notes that supports it’s notes, and in that property it extracts the correct sentence but with the $25 changed to $24 in that reference as well.

I’ve rarely seen a “mistake” like this in extraction, and never one that is wrong in the exact same way so consistently.

Has anyone encountered a similar case? If so, what (if anything) helped?

Diet · April 3, 2024, 2:29am

Welcome to the community!

That does look pretty interesting. Do you think you can share a prompt where this happens?

Which exact model are you using?

_j · April 3, 2024, 7:05am

The numbers 0 (or 00 or 000) to 999 are their own unique tokens.

You can retrieve the logprob values for different inputs and see why “repeat back 25” has a high chance of producing 24 in this particular case.

Then a negative logit_bias could affect the generation and unbias this bad generation… or not if using the “tool call” API and its enabling unwanted “JSON mode” with its anti-developer ignoring of logit_bias.

Topic		Replies	Views
GPT3.5 Turbo downgraded suddenly? API	6	1697	November 14, 2023
Gpt-4o-mini (even gpt-3.5-turbo) works but gpt-4o doesn't Bugs gpt-4o	7	433	November 24, 2024
Open AI APIs responses becoming random Community gpt-4 , api	3	963	April 28, 2024
GPT-4 becoming dumber sometimes, for a while API	7	2934	December 18, 2023
New gpt-4-turbo-preview saying it can't help on complex prompt Prompting gpt-4 , api , gpt-4-turbo	7	2682	January 29, 2024

GPT-4-Turbo consistently getting a number wrong

Related topics