How to speed up OpenAI API calls

SomebodySysop · June 22, 2023, 10:06pm

On it! Thanks for the feedback.

gabriel_jorge · June 24, 2023, 3:36pm

Hey Bill, awesome, thank for the explanation. You mentioned PaLM 2, I searched here and noticed that’s available for Google Vertex API, right?

I’m having an issue trying to translate a text from English to Portuguese, the PaLM 2 is supposed to works on Portuguese, but when I try to translate, I get the following message:

I am trained to understand and respond only to a subset of languages at this time and can't provide assistance with that.

So I have my question now if this LLM really works in Portuguese

bill.french · June 24, 2023, 4:28pm

Bard uses PaLM 2. Try your prompt there to see how it responds.

JustinC · June 24, 2023, 11:30pm

The best thing you could do to “speed up” the API calls for your users is to stream the output rather than waiting for it to complete.

Just like using the chatGPT interface. The query may take awhile but from the user’s perspective they see it spitting out word by word which is a better experience.

liamo · June 28, 2023, 1:26pm

How would I be able to do that just from the API call?

alden · August 11, 2023, 12:44pm

Agree with Justin here – the streaming option works great from a UX perspective. So rather than trying to speed up the API call (which would probably never happen even if you switch LLMs) – change the UX if you can.

pascal3 · October 18, 2023, 9:44am

Hey first of all thanks for opening my eyes. And how did you do like this dashboard?

bill.french · October 18, 2023, 11:21am

Coda.

pascal3 · October 18, 2023, 3:38pm

Ah ok thanks very much. Can I ask what kind of softwares you used for the vector embeddings and in memory caches?

bill.french · October 18, 2023, 5:55pm

Yes - I created vectors using OpenAI in an automated process with Google Apps Script. The resulting vectors are stored in a spreadsheet. The script uses a dot product function to perform similarity queries, which I can perform internally to the script itself or as a web service.

l.dijkman · November 8, 2023, 8:39pm

yep my page is also slow in response
but it is faster as me in code example generating
i am a hobbyist
wasting time

streamed would be nice

Heaven or Hell That is the Question

Topic		Replies	Views
API "gpt-3.5-turbo" Sucks (Slow) API	21	10016	December 16, 2023
GPT-4 API to slow when you have to work with a 46 second time out API	12	2906	December 29, 2025
Response speed with semantic searching API	2	1410	December 29, 2023
How can I improve response times from the OpenAI API while generating responses based on our knowledge base? API chatgpt , api	4	24956	December 29, 2025
Slow Chat api responses ------ API	17	6696	December 24, 2023

How to speed up OpenAI API calls

Related topics