Thank you for your reply!
Can you please explain more details about your testing sceneriors?
-
The response time you measured is based on many times of running and then computed the average value?
-
How many tokens of the text input you feed to GPt-4 Turbo? How the number of text input will affect to the response time?