Compare GPT-4 with other models objectively

AIdeveloper · March 10, 2024, 5:41am

A lot of users have been complaining about the degradation of the GPT-4 but most of them didn’t present concrete prompt/response examples to support their claims. Well, I understand we don’t have the previous state of the GPT-4 to compare with. By the way, I suggest OpenAI publish the release notes that document the changes in detail so that people won’t speculate about them.

Without that info, we cannot compare the GPT-4 releases vertically. I suggest we compare GPT-4 horizontally with other models.

Here is my first comparison. I am pleasantly stunned by Mistral’s ability to solve this math problem, which is presented by OpenAI here, without resorting to any kinds of code interpreter concept. In a way, I think Mistral’s language ability is stronger than that of GPT-4, at least on this particular problem. How do you think?

Topic		Replies	Views
Testing New GPT-4o vs Top 5 AI Community gpt-4 , chatgpt , gemini , claude3 , gpt-4o	0	3009	May 14, 2024
Comparing GPT-4o and O3-Mini on same task Prompting chatgpt , gpt-4o , o3-mini	2	903	March 14, 2025
List of fresh gpt-4o benchmarks, please add Community gpt-4o	1	3457	May 16, 2024
GPT-4-Turbo models perform better the older GPT-4 models in LMSys benchmark API gpt-4 , api	14	6596	May 13, 2024
GPT-4o vs. gpt-4-turbo-2024-04-09, gpt-4o loses API gpt-4	38	14925	June 11, 2024

Compare GPT-4 with other models objectively

Related topics