Long prompt completion request returning each token as a separate API response, along with 524 error

Jordy · May 11, 2023, 5:18am

Hi,

I am using Wized to retrieve data from the OpenAI API within Webflow. I am running into an issue where if I have a long prompt (~80 words) or set max_tokens to > ~500, then the API is returning what seems like one token at a time. This is happening regardless of whether I use /chat/completions or /completions, and for both GPT-3.5 and GPT-4 models.

Does this seem like a Wized issue or is this standard behaviour?

Here is an image of what is being returned if that helps - as you can see it is returning HTML which I asked for, but not in the conventional way (see example underneath for a shorter prompt working properly):

Any help appreciated!

Jordy · May 11, 2023, 5:18am

This is what a correct output should return:

Chriss4123 · May 11, 2023, 11:25am

Is the stream parameter set to false in the API request?

Topic		Replies	Views
Long empty response from chatgpt 3.5? API chatgpt , api	4	1459	December 18, 2023
Completions - Request failed with status code 400 API	1	1395	December 21, 2023
API is throttling response word count even with high token size API	0	18	February 6, 2025
Why do I get incomplete response and output Prompting	8	5574	December 19, 2023
GPT-4: ESOCKETTIMEDOUT error when calling the api API	8	1112	December 15, 2023

Long prompt completion request returning each token as a separate API response, along with 524 error

Related topics