GPT-Vision Performance Improvements

georg-san · January 24, 2024, 12:45am

I’m looking for ideas/feedback on how to improve the response time with GPT-Vision. Does anyone know how any of the following contribute to a impact response times:

System message length (e.g. 2 sentences vs 4 paragraphs)
Image size
[ ## Low or high fidelity image understanding
(https://platform.openai.com/docs/guides/vision/low-or-high-fidelity-image-understanding) via the detail parameter

Are there any other considerations/learnings for faster response time?

For context, in our UI the response can take anywhere from 5-15+ seconds. We have a long system message (~4-5 paragraphs), relatively small image size and detail parameter set to low. The response token length varies but is usually somewhere between 50-200.

PaulBellow · January 24, 2024, 12:56am

The bigger the prompt, the longer it will take in general.

You might test both, but I don’t see you shaving more than a couple seconds? It has to do with network traffic on OpenAI’s end and other factors too.

All that said, I imagine the tech will become faster as time goes on. We’re a long way from GPT-2 just a few years ago!

Topic		Replies	Views
Question About Speed Of GPT4-Turbo W/Images API gpt-4 , api	5	1448	March 8, 2024
GPT-Image-1 Model – Experiencing Delayed API Responses Feedback gpt-4 , chatgpt , api	3	298	July 23, 2025
Improve response time of GPT API gpt-4	1	1100	December 30, 2023
Optimizing GPT4 request & best practices API	0	625	April 3, 2023
Consistent slow communication after API call API gpt-4-vision	8	782	March 21, 2024

GPT-Vision Performance Improvements

Related topics