Right now I’m seeing anywhere from 30 seconds to a minute on response times for the GPT-4 API with prompt tokens of ~1500 and response tokens of ~400. Does anyone know when we can expect to see these times start to speedup to sub 30 seconds? I have found little information on this on the web and other forums.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
GPT 4 API is Very Slow Still | 15 | 6633 | December 16, 2023 | |
GPT 4 API taking more time to render things asked through prompts | 1 | 522 | September 14, 2023 | |
Gpt-4-0125-preview is slower than gpt-4-0613? | 5 | 5503 | January 30, 2024 | |
When will the response time/timeout issue be addressed? | 1 | 1361 | November 2, 2023 | |
Gpt-4-0125-preview INCREDIBLY slower than 3.5 turbo | 12 | 9429 | July 22, 2024 |