This week has been filled with disappointment regarding GPT-5. I noticed a major change in the model’s behavior, especially in tasks that used to be simple to solve.
I do not have technical evidence or concrete proof to state this with certainty, but my user experience has changed a lot. The GPT-5 I am using now does not feel like the same model from launch.
I tried to complete the same task dozens of times. I adjusted the prompts several times, made the instructions clearer, rewrote the requests, explained the goal in different ways, and even then the model kept delivering almost the same result. In some cases, I asked it to redo the task more than five times, always reinforcing what needed to be corrected, but it ignored important parts of the instructions and repeated the same mistakes.
The behavior became very similar to what I have seen in other models: instead of following the request precisely, it tries to “work around” the problem, creates messy fixes, invents improvised solutions, generates unnecessary scripts, and when I ask for adjustments, the result often becomes even worse. The model feels less careful, less consistent, and less obedient to instructions.
I reached the point where I had to create a skill with detailed instructions, along with much more structured prompts, just to try to get a minimally acceptable result. Even then, I could not reach the same level of quality that I used to get before with a simple prompt and, at most, one or two adjustments.
My impression is that the current model may not be exactly the same as the launch version. It feels like a smaller, more limited variant, or like some kind of optimization has been applied that reduced the quality of the responses. I know this is only a personal assumption, but the difference in behavior is too significant to ignore.
One clear example is GPT-5’s reasoning mode. Before, it seemed to think for longer, analyze the problem better, look for sources when necessary, and deliver more consistent answers. Now, many times, the reasoning seems to last only two or three seconds before it starts answering. This behavior reminds me a lot of smaller models or models optimized to answer quickly, but with less depth.
The issue is not just one bad response. The issue is the repeated pattern: ignored instructions, poor self-correction, improvised solutions, loss of context, lack of depth, and difficulty delivering something that used to be relatively simple.
I would like to know whether other people have also noticed this change in GPT-5 over the last few weeks. For me, the current experience is far below the quality the model showed at launch.

