Huge quality drop in gpt-4-turbo

beyonce · May 29, 2024, 5:02am

Last week, I ran gpt-4-turbo on my task (>1000 inferences) and it worked great. Today, I am trying the exact same prompts and it’s performing much worse. For example, it’s now consistently failing to write output in the expected format even though I included 5 exemplars for the expected format.

Has something changed with the gpt-4-turbo api recently? FWIW I am using gpt-4-turbo instead of gpt-4o because regular 4 worked better for my task when I compared the output last week.

sps · May 29, 2024, 5:26am

Usually if there are any changes made to the underlying model, the system fingerprint changes.

bijutoha · May 29, 2024, 11:09am

Running a few of your prompts with the same configuration to see if it’s a one-time issue.

Powder · May 29, 2024, 12:04pm

I am also having this issue. It was working fine last week.

MHiti · May 29, 2024, 12:37pm

We are experiencing the same situation. Migrated from GPT4-turbo to Omni and then back after a few days. We’re using Chat API to generate JSON, and the number of errors in JSON formatting has dramatically increased (not to mention issues with the content itself). This change is evident in our logs; something has definitely worsened.

ujLion · May 30, 2024, 6:33am

I have found that of all formats the markdown table format is most consistent across new models. Hence if possible i would suggest try adopting your code to the markdown table format.

OKhan · May 30, 2024, 8:41am

condition is very bad even it crashes the webpage

i checked in task manager chatgpt while running code takes all the resources cpu usage goes to 100% and in the end it crashes the web page

continuously i am getting regenerate option

jochenschultz · May 30, 2024, 9:17am

Most probably all quality has gone over to gpt4o…

Could you explain a little bit deeper? How many times did you check the “quality” last week and what exactly has changed since then?

merefield · May 30, 2024, 9:31am

This topic concerns the API not ChatGPT?

beyonce · May 30, 2024, 12:58pm

I am using the API. That’s why I included the tag

beyonce · May 30, 2024, 1:03pm

It’s a simple instruction. Just needs to format the response in ‘finalAnswer’ tags, but now I’m getting inconsistent tags like ‘finalFoo’, ‘finalBar’, etc. Wasn’t a problem last week when I ran 1000s of inferences.

jochenschultz · May 30, 2024, 2:11pm

You need a solution for that or are you just angry and want to express that?

If it is a simple thing. Would you mind posting the prompt or send it to me in a private message - I swear I don’t need it.

MHiti · May 30, 2024, 4:08pm

We now downgraded to gpt-4-0125-preview. The results are consistently better than with gpt-4-turbo-2024-04-09 and gpt-4o.

The quality is similar as is used to be with gpt-4-0125-preview few weeks ago.

OKhan · May 30, 2024, 6:13pm

thanks,

sorry i didnt read it lol

Topic		Replies	Views
GPT 4 Turbo regression over GPT 4 API gpt-4 , gpt-4-turbo	2	1579	November 24, 2023
About degraded quality of gpt-3.5-turbo's responses API api	1	175	June 20, 2024
New Model GPT-4-o e GPT-4 Turbo are returning weird results Bugs	9	694	June 1, 2024
Gpt 4 Quality Fluctuation API gpt-4	5	1007	May 1, 2024
Has regular gpt-4 model changed for the worse by any chance? Community gpt-4 , hallucinations	12	1655	April 23, 2025

Huge quality drop in gpt-4-turbo

Related topics