Is there a way to prevent the gpt-3.5-turbo API from returning content in chunks?

R2D2BOT · March 2, 2023, 11:05pm

I have been testing the gpt-3.5-turbo API and my cURL requests all come back in chunks, with each work of the response content in a different chunk. Is there a way to have it all come in at once? Thanks

patrick.metzdorf · March 2, 2023, 11:26pm

I’m not sure I understand. Is it not returning this, as the docs describe?

{
 'id': 'chatcmpl-6p9XYPYSTTRi0xEviKjjilqrWU2Ve',
 'object': 'chat.completion',
 'created': 1677649420,
 'model': 'gpt-3.5-turbo',
 'usage': {'prompt_tokens': 56, 'completion_tokens': 31, 'total_tokens': 87},
 'choices': [
   {
    'message': {
      'role': 'assistant',
      'content': 'The 2020 World Series was played in Arlington, Texas at the Globe Life Field, which was the new home stadium for the Texas Rangers.'},
    'finish_reason': 'stop',
    'index': 0
   }
  ]
}

Edit: Looks like the stream option is it:

raymonddavey · March 2, 2023, 11:33pm

You need to check if the streaming setting is set to true

If you want it all at once, set the value to false

R2D2BOT · March 3, 2023, 4:18am

Thanks, I’ve looked everywhere but can’t find the streaming option. Do you have any pointers where it is? Thanks!!

ruby_coder · March 3, 2023, 6:29am

Reference:

OpenAI API: Chat, create

Topic		Replies	Views
Gpt-3.5-turbo-instruct stream=True not working? API gpt-35-turbo-instruc	8	4159	December 16, 2023
In GPT4 streamed responses all chunks come in a single batch API streaming	4	3185	June 7, 2024
API Assistant - Streaming curl API	5	5093	February 5, 2025
Does anyone know when GPT-4o-mini-search-preview will support streaming? API	2	232	March 21, 2025
GPT4o Hangs after first chunk API gpt-4o	8	839	June 4, 2024

Is there a way to prevent the gpt-3.5-turbo API from returning content in chunks?

Related topics