How to improve the speed of the ChatGPT API calling?


I am using ChatGPT API to translate texts.

For a sample prompt like below:

Please translate the following English contents to Spanish. Do not change or remove the HTML tags. Also keep the double quotation marks around the contents. "<a href="" target="_blank" rel="noopener">Bank of America</a>, a <a href="" target="_blank" rel="noopener">Fortune 500</a> and one of the leading global banking institutions, found itself facing a significant data issue in 2016. Due to a system malfunction during an update, a massive volume of critical <a href="" target="_blank" rel="noopener">PDF documents</a> became corrupted, rendering them unreadable. This case study delves into how <a href="" target="_blank" rel="noopener">DataNumen PDF Repair</a> helped Bank of America swiftly recover from this predicament, safeguarding their operations, customer relations, and compliance obligations."

gpt4 will take 23s to translate it.
gpt3.5 turbo will take 7s to translate it.

Is there a way to improve the speed? I see a lot of models, which will provide the fastest result meanwhile keep the quality of the result?


For translation, I would not use anything less than GPT 3.5. The speed is down to the complexity of the model and the sheer number of people using it, if you were to require 450million tokens per day then you could find an economic argument for having your own dedicated instance, but that would be a sizable target to reach.