Improve response time of GPT

We are developing a tutoring platform and are using GPT4 Vision for it.

Due to the complexity of the tasks, the answer takes up to 2 minutes and the users find this super annoying.

To test our prompts we use as well as

On both platforms, the output is significantly faster and GPT starts writing the message after just a few seconds.

Do you know why that is?
Is there a way to get the answer in parts and start displaying the answer?


It’s called streaming :slight_smile:

1 Like