O3 and o4-mini are lazy. Not suitable for coding at all

Ejjejrjr_Wywhheehe · April 21, 2025, 2:28pm

It’s very, very bad now for coding. Feels like GPT-4 Turbo level, or close.
o3-mini-high was miles ahead of the current o4-mini-high or o3.
These models seem heavily gimped, possibly due to compute constraints? They don’t really think anymore - they feel lazy and shallow. Hellish experience after o1 and o3-mini.

How does that reduce costs, if now I need to enter 10 separate requests to solve a simple task that took only 1-3 prompts with o3-mini-high?

Who’s going to give me back the time wasted debugging the nonsense these newer models output?
And then I have to submit 30 prompts just to get the simplest code working? That’s absurd.

I genuinely can’t believe that the amazing engineers at OpenAI - the same people who delivered the excellent o3-mini and o1 - are behind these current models.
It’s astonishing and mind-boggling how such a downgrade made it into production.

How did this even pass QA?
If it did pass, then… is this level of performance actually intended?

Are there any updates from OpenAI?
Any fixes coming?

Topic		Replies	Views
Has anyone noticed GPT4o quality drop last few days? Feedback	87	6844	May 12, 2025
This week's launches: o3, o4-mini, GPT-4.1, and Codex CLI Announcements	38	4275	May 16, 2025
GPT4-Turbo more "stupid/lazy" - It's not a GPT4 API gpt-4 , chatgpt , gpt-4-turbo	33	11374	March 18, 2024
GPT-4o performing poorly for code related tasks! Why? Feedback gpt-4 , chatgpt , api , lost-user	39	4697	July 19, 2024
Open AI gpt-4o mini low iq Community gpt-4 , chatgpt	12	1377	January 17, 2025

O3 and o4-mini are lazy. Not suitable for coding at all

Related topics