Is there an OpenAI API benchmark (speed ...)?

louis030195 · July 5, 2023, 5:52pm

Hey curious to know if there is a public benchmark of OpenAI api in terms of speed of different models with some variables like time, model, parameters …

Maybe something like https://status.openai.com/ but for speed

Eventually a comparison with other AI APIs?

PS: would be happy to benchmark the API if given free credits of course

novaphil · July 5, 2023, 5:56pm

Not mine, but https://www.gptstat.us

JustinC · July 8, 2023, 2:31pm

@taivo wrote a nice blog post about his experiments with speed. His conclusion was that Azure was almost twice as fast. Which makes sense since they have more hardware.

Looks like he wrote another post too, with some speed tips.
https://www.taivo.ai/__making-gpt-api-responses-faster/

louis030195 · December 12, 2023, 1:07am

Anyone know a comparison of OpenAI’s LLMs token/s vs open source LLMs through vLLM for example?

_j · December 12, 2023, 2:01am

You cannot compare the token generation rate of open source models to OpenAI API models.

The first is hardware-dependent, and the second is secret hardware and payment tier-based.

Topic		Replies	Views
Gpt-4o tokens per second comparable to gpt-3.5-turbo. Data and analysis API gpt-4 , gpt-35-turbo , playground , gpt-4-turbo , gpt-4o	3	11112	August 16, 2024
GPT-3.5 and GPT-4 API response time measurements - FYI API	19	35427	February 6, 2024
Benchmarking response time for GPT4 by context+output tokens API gpt-4 , api-speed	6	6465	November 3, 2023
API latency when backend is hosted on Azure? API api	7	4618	November 3, 2023
Let's compare our API speed? It's too slow! API gpt-4 , gpt-35 , gpt-35-turbo , api , playground	9	1848	October 29, 2023

Is there an OpenAI API benchmark (speed ...)?

Related topics