Has anyone noticed GPT4o quality drop last few days?

majkee21 · June 19, 2024, 12:39pm

I’ve noticed a significant drop in the context awareness when generating python code. For example when I ask chat to modify the script based on my guidelines and then ask to add some functionality, it forgets it’s own modifications and alters first version of code. What is even worse, it doesn’t follow my very simple and basic instructions and instead goes wild off. And this happens on a discussion which is 6,696 tokens long with code being only 25-35 lines. It’s worse than GPT3.5. I tried multiple chats on the same topic and it’s getting even worse than that. Anyone experienced same issues over the last few days?

EDIT: Thank you to everyone who has contributed to this discussion and added valuable feedback! I want to clarify that my initial concerns were based on a temporary feeling after performing a few tests that had worked differently just a few days earlier. Since then, I’ve moved on from that impression.

I’ve been working with ChatGPT daily since December 2022, and it continues to be an incredible tool that helps me automate my work and support various companies. With the o1-preview model, I’ve been able to take my research and personal projects to a whole new level.

I don’t mean to criticize the service in any way. I was just curious to check in with other users to see how things were going for them. Thanks again for all your insights! Love you all!

herve76 · June 19, 2024, 3:57pm

it’s possible than the quality drop might be linked to peak demand.

Diet · June 19, 2024, 4:20pm

how do you figure? I imagine it either works, works slowly (memory bandwidth limits reached), or not at all (OOM)

unless they actually deploy smaller models during peak demand without telling anyone. That would be funny as hell.

Per · June 19, 2024, 5:05pm

My experience is that all kinds of weird problems can happen when a service is overloaded

guyensotowong · June 19, 2024, 6:36pm

100% sure that quality drop a lot. I guess they are doing any update

johann.oosthuizen · June 19, 2024, 9:52pm

I am experiencing better performance combining GitHub Co-pilot and ChatGPT.

The file context awareness helps my workflow a lot and I can diff changes.

brudarko · June 19, 2024, 11:35pm

Yeah, I am feeling the same here. My assistants got dumber and dumber.

Spawny · June 20, 2024, 7:16am

I’m feeling it too. I was in a conversation about a script. Something wasn’t working, fixed a mistake (context related, so not GPTs fault), fed the new script to the conversation and asked for some additions. Much to my surprise, the GPT made the changes to the script it wrote before and ignored the new script I gave it in the same conversation.
I also ended up in a loop with an error I was having. The first suggestion from GPT didn’t work, so I asked for another solution. That also didn’t work and when I let GPT know, it’s response was the first solution it gave. When I pointed that out, it apologised and then gave me the second solution again. Just for testing purposes, I kept asking for working solutions and the GPT kept alternating between solutions 1 and 2.
Never has that (at least not to that extent) with GPT4. GPT4 would at least recommend me to contact an expert when it was out of ideas.

hjerneneralene · June 20, 2024, 8:28am

Yes, I wish Open AI (maybe it is time for a rebrand) would be more open with what they’re doing. They released 4-o, it was great, then there was a huge outage and it got noticeably worse.

1372417922 · June 20, 2024, 8:30am

I work with prompts every single day for several months, I think I understand why you feel like the quality drops. In fact, it is always a big problem for GPT4 or GPT4o to solve complex problems in terms of instability and hallucinations. Every time I modified a prompt, I would usually run it 50 times to test whether it is stable. It is always unstable unless you are very very careful with what you say. I guess, the more you work with GPT, the more mistakes you will find that GPT makes. Maybe it is a sign that you are more familiar with GPT now.

nicewaytodoit · June 20, 2024, 8:46am

Since day one, I had feeling they replaced GPT 4 with GPT 2, most of my prompts were worse than GPT 3, festered with mistakes, so I canceled my subscription. After all the hype for me it was kind of a big disappointment.
I do not know what is going behind the scene, but I hope problems will get solved.

xpfredy · June 20, 2024, 11:31am

AI Studio is doing a much better job lately… For free…

natanael.wf · June 20, 2024, 11:36am

GPT4o is a disappointment in terms of cognitive capabilities, always has been, especially when it comes to large context windows. People like it because it’s fast and writes a lot of code, but the quality is annoying.

jw82 · June 20, 2024, 1:06pm

Since launch day, I spent approximately 18 hours working with ChatGPT 4o, expecting it to be an improvement over its predecessor. Unfortunately, I found the quality of its code tasks, context awareness, and ability to follow instructions significantly declined. It became so unusable for coding that my entire team stopped using ChatGPT altogether. OpenAI seems to have reduced its capabilities. Now, I use Clause 3 Opus, and the difference is like night and day.

It is really sad to see them not do anything about it.

natanael.wf · June 20, 2024, 2:19pm

Sam Altman has spoken countless times about iterative development. He has also said that OpenAI has solid evidence that GPT-5 is smarter than GPT-4 and that GPT-6 will be smarter than GPT-5.

Considering that GPT-4 is a 2022 technology and OpenAI appears to already be working on version 6, “iterative development” for Altman means:
working toward AGI in the lab while customers iterate on different versions of GPT-4.

_j · June 20, 2024, 2:26pm

Altman also said GPT-4 was as dumb as it ever would be. A year of “GPT-4” branded models that only get dumber prove otherwise.

clerigomarcelogarcia · June 20, 2024, 2:46pm

Desde el dia 17 de junio he esperimentado problemas con esta Chat GPT y todas sus competidoras ninguno te resuleve nada y da respuestas sesgadas o cortadas, y no veo en nigun lado que se mencione esto cambien de vpn pensando que era por la region donde me encontraba pero de igual manera no da respuestas, pense que era por usar el grauito y es lo mismo en la de pago, alguien mas esta experimentando esto?

engagepy · June 21, 2024, 6:02am

Its funny, few weeks after each model release I notice these updates. I am hoping it is only the peak demand and not akin to battery draining moves attempted by the likes of apple in the past to generate more revenue.

tiakeicuecus1975 · June 21, 2024, 7:52am

I’ve noticed that the content of the replies to the gpt4 & gpt4o models under my account are all related to 3.5.
When I ask: “what’s your current model and knowledge cut off date?”, the response is always: “As of my last update, I am based on the GPT-4 My knowledge is current up until January 2022!”.
Obviously, January 2022 is the knowledge cut off date for GPT3.5.

Per · June 21, 2024, 8:43am

As they say “never play on release day” seems to be true with AI also.

I Like this status page better then the official:

Topic		Replies	Views
I'm going to be honest here: since the release of the GPT-4o updates, ChatGPT has been getting more and more problematic. My GPT responses are not what they used to be Plugins / Actions builders gpt-4 , plugin-development	67	5997	February 12, 2025
GPT-4 vs GPT-4o? Which is the better? Community gpt-4	80	254502	May 1, 2025
O1 not as good as o1-preview for problem solving Community chatgpt	33	3190	January 13, 2025
GPT4-Turbo more "stupid/lazy" - It's not a GPT4 API gpt-4 , chatgpt , gpt-4-turbo	33	11345	March 18, 2024
New reasoning models: OpenAI o1-preview and o1-mini Announcements	114	14272	September 28, 2024

Has anyone noticed GPT4o quality drop last few days?

Related topics