Which model is faster gpt-3.5-turbo-1106 OR gpt-4-preview-1106?

mohitshah9920 · November 24, 2023, 5:00pm

I am trying to use assistants API with retrieval and it is quite slow. It takes anywhere between 3 seconds to 15 seconds to respond. Sometimes even more.

This question is for the Open API team, which model will give me faster results on average? gpt-3.5-turbo-1106 OR gpt-4-preview-1106?

Also, I am on Usage tier 3. Will upgrading to Usage tier 4 reduce latency?

Thank you for your time

Fusseldieb · November 24, 2023, 5:15pm

GPT3.5 will always be faster since it’s a “lighter” model. GPT4 is pretty ressource-intensive on the servers, hence why it’s slower.
Upgrading tiers will likely not change the outcome.

Imagiro · November 24, 2023, 5:33pm

Actually, recently I had the feeling that GPT4 is performing quite better. I thought, maybe that’s because fewer people are using it yet?

adaptiv · November 24, 2023, 8:57pm

Yeah, its performing better (more reliable) when you have a lot of input tokens. GPT-3.5 is crashing mostly on every request currently. Guess the model is too crowded.

Topic		Replies	Views
Night and day different in Assistant's API latency between gpt-3.5 versus gpt-4-turbo Feedback openai-documentation	2	1667	February 16, 2024
GP4-Turbo V2 is it slower? Feedback gpt-4	1	277	May 1, 2024
ChatGPT 3.5 Turbo Vs ChatGPT 4 - API Response Speed API	9	3780	December 24, 2023
Really slow response time with text completions API today? API	2	197	May 27, 2025
Assistant GPT 3.5 model API poor performance Bugs assistants-api	1	576	January 22, 2024

Which model is faster gpt-3.5-turbo-1106 OR gpt-4-preview-1106?

Related topics