GPT-4 vs GPT-4o? Which is the better?

milesdr · May 17, 2024, 7:01pm

GPT-4 is still much better for our complex tasks that require careful reading and proper prompt following. GPT-4-Turbo was OK for remedial tasks or “conversation” but we use GPT-3.5-turbo for that.

GPT-4o is very bad compared to GPT-4 and even GPT-4-turbo for our uses, but we switched to GPT-4o anyway because of the price and have our scripts filter out the terrible outputs we receive sometimes…some of the outputs are random strings that have nothing to do with our prompts. Once 4o gave us information on a Boeing plane specs randomly.

Frustrating to see leaps forward in Image reading (4o is GREAT at that) but large steps back in complex analysis or tasks.

One of our simplest benchmarks is whether a model can answer a Multiple Choice Question of “All of the following are TRUE, EXCEPT:” on a semi-complex topic.

4 fails often but the rest of the models fail every time.

Topic		Replies	Views
GPT-5 Coding Feels Downgraded — Please Fix This Codex	128	15505	January 27, 2026
Day 12 of Shipmas: New frontier models o3 and o3-mini announcement Community shipmas	71	9229	December 26, 2024
Announcing GPT-4o in the API! Announcements	130	111091	July 4, 2024
Mystery model popped up on lmsys gpt2-chatbot - gpt4.5? Community gpt-4	53	12161	May 14, 2024
GPT scares me and here's why Prompting	91	14439	December 15, 2023

GPT-4 vs GPT-4o? Which is the better?

Related topics