I've been doing speedtests, is this "normal"?

manyapps · May 21, 2023, 9:25am

We’ve been having mixed results when it comes to speed of the API requests, so I started logging steps in each action our scripts take. E.g., there are logs in place for getting the vectorized content from our database, getting the answer from our embedded context etc.

Here’s a graph from our past ~1000 requests. There’s a bit of speed to be gained on our end (working on optimizing that), but as far as I know, there’s not much we can do to speed up what lies elsewhere, in this case, at OpenAI.

Are you seeing similar speeds when using the embedding method?
(if yo uneed more detailed views, let me know I’ll edit the graph data)

This is using the API, GPT-4 for chat (not completions).

ThioJoe · May 21, 2023, 5:39pm

Interesting. I have nothing of substance to add, but it might be neat to graph the average of different time periods across multiple days. To see what effect peak times have.

Topic		Replies	Views
Responses API (with RAG) generation performance data Feedback	1	353	April 4, 2025
What is considered as normal latency? API	3	3189	December 15, 2023
Benchmarking response time for GPT4 by context+output tokens API gpt-4 , api-speed	6	7168	November 3, 2023
How to speed up OpenAI API calls Community api	31	37971	December 13, 2023
Slow Chat api responses ------ API	17	6684	December 24, 2023

I've been doing speedtests, is this "normal"?

Related topics