OpenAI did made GPT3.5 more stupid?

theevildays · June 13, 2023, 7:06pm

In the old times, if you write a SYSTEM prompt and delete default USER input, a reasonable output will be generated, but now, if you do so, garbage will come out.

supereric7748 · June 13, 2023, 7:20pm

It does feel like in some areas it has been underperforming older chatgpt versions. I do think that they work hard on improving overall user experience but feels bad that it is somewhat give an take between use cases users.

theevildays · June 13, 2023, 7:31pm

I think they might have distilled GPT3.5’s 175B params into a smaller model.
Warning: It is just a thought! I guessed. It has a very high probability to be wrong.

stevenic · June 13, 2023, 7:33pm

You should give AlphaWave a try… https://GitHub/Stevenic/AlphaWave-py

I guarantee you’ll get valid JSON back every time. Even with GPT-3.5-turbo

smuzani · June 13, 2023, 9:11pm

The prompt could be improved though. It is a natural language system with the reasoning ability of a 5-year old. Ideally talk to it like you would a child.

Even as an adult who does programming, it’s not entirely clear what you’re expecting. You have to give it some examples.

Also the system instructions are more like “soft” instructions with GPT3.5, and they seem to lose fine details like commands and formatting.

_j · June 13, 2023, 9:16pm

Likely wrong conclusion to think that they spend another million dollars in compute just to train on smaller tokens or parameters, when the lack of computing resources are the biggest hurdle to moving forward.
There’s lots of other optimizations to be done to an AI that cut the generation resources at the expense of quality.

GlattK · June 13, 2023, 9:17pm

I can confirm it is not just 3.5 but also 4 and 4-0316 in the playground for me. Very noticeable drop in memory, logic and reasoning

theevildays · June 14, 2023, 9:45am

In the “old version” (if there’s any), the model would simulate user input and outputs.
It used to work as zero-shot model, though.

theevildays · June 14, 2023, 9:46am

Actually I haven’t get GPT-4 API access until now so I have no clue how’s GPT-4 doing.
But I’d rather spend more on better models rather than decreasing the price and make the model worse.

theevildays · June 14, 2023, 9:46am

You’re right, but they can just release more versions and make pricing more detailed.

PajamaSuit · June 14, 2023, 2:59pm

Big updates to 3.5-turbo today, new version of the model releasing

theevildays · June 14, 2023, 4:23pm

Welcome to the forum!
I know this update, I thought they made old version more stupid and the new version better than the old one.
The 16k-model seemed to be better.

rankoneads · June 14, 2023, 9:54pm

The new version is significantly less intelligent. It’s depressing because I got online thinking the new engine would be better. This is what we’re stuck with.

theevildays · June 15, 2023, 6:57pm

Sad, maybe they know there are competitors so they started to make it cheaper.

danielsinewe · June 15, 2023, 8:37pm

I completely agree! I’m deeply involved in cold email outreach and I’ve found that the new model seems almost programmed to start every email with ‘I hope this email finds you well’. Despite spending hours trying to find a workaround, I’ve had no luck in changing this default greeting. Interestingly, this wasn’t an issue with the previous version.

theevildays · June 15, 2023, 8:38pm

They changed stuffs, without changing the model name.

steve0k · June 15, 2023, 10:44pm

Have you tried adding enforcement to the input itself for the most important parts, vs depending on the internal guidelines/rules to work? I have found that it works great. If you haven’t and want to try I can give you more tips, just let me know.

rankoneads · June 16, 2023, 12:36am

It’s very frustrating how it refuses to follow some instructions now. My theory is that the bulk of the computational power for an LLM is comprehending the request, similar to a human.

If we as humans understand the instructions, giving a response (even if it’s wrong) is relatively easy. If we are given detailed/advanced instructions, understanding them requires a lot of brain power and we will only respond / do the parts we understand.

That’s essentially what the engine is doing now. It’s not using computational resources to comprehend the request, it’s simply blast out a response to whichever part is easiest for it to understand.

I imagine Microsoft and OpenAI are going to save hundreds of millions, if not billions with this change, but it’s terrible for the end-user. They should at minimum give it 10% intelligence back.

theevildays · June 16, 2023, 9:24am

If that works well for you, maybe ClosedAI is trying to force users to consume more tokens so they make more money, evil move.

theevildays · June 16, 2023, 9:25am

That’s how they would make more money hehe.

Topic		Replies	Views
Has the reasoning ability of the GPT 3.5 API dropped recently? API chatgpt , api	9	1077	December 25, 2023
What announcements are you looking forward to on the 6th November 2023? Community gpt-4 , gpt-35-turbo , fine-tuning , api , devday	27	4353	November 6, 2023
OpenAI Ships like no other Community chatgpt	6	1006	March 30, 2025
Just got access to GPT-4 but it responds like 3.5 API gpt-4	13	8166	July 8, 2023
GPT-3.5-turbo-1106 is worse than 0613 version API gpt-35-turbo , api	15	7193	April 25, 2024

OpenAI did made GPT3.5 more stupid?

Related topics