Which model is best for speed and accuracy?

_j · February 25, 2025, 10:08am

gpt-4o-mini wins for speed.

Model	Trials	Avg Latency (s)	Avg Rate (tokens/s)
gpt-4o-2024-08-06	4	0.739	41.698
gpt-4o-2024-05-13	4	0.730	64.069
gpt-4o-2024-11-20	4	0.676	37.113
gpt-4o-mini	4	0.558	111.561
gpt-3.5-turbo	4	0.571	63.459

(this is me running all 20 API call trials in parallel, with a small messages input.)

gpt-4o-mini has a decidedly different response quality and understanding, especially in a longer chat. gpt-4o-mini also allows much more as input messages. It might predictively chat well, but it also does not adapt well to original tasks an API developer might “program”. You will need to evaluate the quality of each.

Topic		Replies	Views
Dev Guide: when to use GPT-4o vs Turbo? API gpt-4-turbo , gpt-4o	11	14382	June 11, 2024
What is the difference between gpt-40-mini and gpt-4o model? API chatgpt , api	3	6137	February 5, 2025
What is best model in openai? Community gpt-4	3	6556	May 10, 2024
GPT 3.5 is faster and better than GPT4o mini Feedback	1	2161	August 21, 2024
Which model is faster gpt-3.5-turbo-1106 OR gpt-4-preview-1106? API assistants-api	4	11152	April 9, 2024

Which model is best for speed and accuracy?

Related topics