Azure OpenAI streaming vs. OpenAI directly

alex74 · June 30, 2023, 9:17am

Has anyone noticed a change in behavior of the Azure Streaming compared to OpenAI. It seems that Azure has stopped streaming chunks and is now streaming whole sentences. I am wondering why because the usability is now worse then before…

cheers
Alex

Foxalabs · June 30, 2023, 9:36am

SSE Server side events can be in packet sizes or time steps of the server providers choosing, your SSE client should be constructed to accept SSE parts that vary in size from a single character to the full payload. SSE events are often managed with load variation such that during heavy load or with systems that operating very quickly internally, the events are generated at a steady pace that does not overload any subsystems.

yhostc · July 7, 2023, 8:21am

Yes, I noticed this problem too. In the openai interface, it can be output verbatim with a good user experience effect. But in azure, users often need to wait for a while, and suddenly a few paragraphs pop up, and the user experience is very poor.

I wonder if there is a way to compensate or get better?

Foxalabs · July 7, 2023, 8:25am

Might be something worth raising a ticket with Azure support about

Topic		Replies	Views
Malformed streaming answers from GPT-4 completions API lately API	11	2184	November 13, 2023
In GPT4 streamed responses all chunks come in a single batch API streaming	4	2488	June 7, 2024
BUG: Streaming packets changed Bugs chatgpt , api	23	2742	November 1, 2023
Updated (not a bug) : BUG in new endpoint - Streaming not working - Hanging connection API	8	2249	December 24, 2023
GPT-4 extremely slow compared to 3.5 API	15	8435	December 17, 2023

Azure OpenAI streaming vs. OpenAI directly

Related topics