Over that last couple of days 5.5 xhigh has turned from a highly capable agent, to a toddler which can barely follow multistep instructions and implementation. Never ever have I experienced such utter frustration when trying to work with this model. The difference is extremely noticeable.
As I type this out, I have it running on the exact same task for the 10th time. The task itself comprises of 3 different things, which are not even moderate difficulty. Each time it has forgot 1 or even 2 of the 3 things. Other times it has been extremely lazy and just decided not to do one of the 3 things. This is not 5.5, it never did this. This behavior has been consistent for the last few days.
Openai should be sued for doing things like this. I don’t pay 200 a month for a functioning model to be degraded.
Reasons why this is happening:
- nerfing the capabilities of 5.5 to make 5.6 seem better than it actually is.
- cost reductions hoping that no one notices
