We are developing a tutoring platform and are using GPT4 Vision for it.
Due to the complexity of the tasks, the answer takes up to 2 minutes and the users find this super annoying.
On both platforms, the output is significantly faster and GPT starts writing the message after just a few seconds.
Do you know why that is?
Is there a way to get the answer in parts and start displaying the answer?