Intermittent Multiple Responses in Single Output with gpt-5.2-chat-latest

thsgustlr0318 · December 19, 2025, 7:25am

I am reporting an issue where the gpt-5.2-chat-latest model intermittently generates multiple similar answers within a single API response. I am accessing the model via Azure OpenAI Service using LangChain.

Environment Details:

Platform: Azure OpenAI Service
Model Name: gpt-5.2-chat-latest
Framework: LangChain (ChatOpenAI / AzureChatOpenAI)
Method: Chat Completion

Observed Behavior:

Intermittency: The issue does not happen on every call but occurs intermittently.
Multiple Answers: When the issue occurs, the single response string contains 2 to 4 distinct answers to the same prompt.
Content: These multiple answers are very similar in content but appear sequentially within the same output block.

Steps to Reproduce:

Initialize the LangChain Azure OpenAI client with gpt-5.2-chat-latest.
Send a user query.
Observe that occasionally the returned string contains 2 to 4 similar responses combined.

Sanitized Example (General Topic):
Below is an example of the behavior.

User Query:
“What are the benefits of drinking water?”

Model Response (Bugged):

### Benefits of Drinking Water
Staying hydrated is crucial for your health. It helps increase energy, improves skin complexion, and aids digestion. Remember to drink plenty of water daily.

### Benefits of Drinking Water
Water is essential for health. It boosts energy levels, keeps skin moisturized, and supports digestion. Aim for 8 glasses a day.

### Why Water is Important
Drinking water helps with energy, skin health, and digestion. It is vital for your well-being.

(Note: The model provides 2-4 similar answers in a row within one single response.)

ajarnspencer · December 19, 2025, 9:33am

When you interact with an AI, the “thought process” usually happens behind the scenes. However, what you are describing. Seeing four potential answers flowing past sounds like a peek into the inference and streaming process. It could be parallel sampling and beaming, streaming and latency problem, UI or UX testing. But; It seems highly probable that what was observed was a leak from the backend processes where the model generates several candidates in parallel. In the world of large language models, this is often linked to how the system evaluates the probability of a sequence of words.

Topic		Replies	Views
Randomized responses from Chat completions with gpt-4-0125-preview Bugs	0	118	March 10, 2025
Seeing intermittent duplicate strings in GPT-5.4 responses Bugs	4	295	May 21, 2026
Multiple Assistant Messages in API Output API bug , api , responses-api	0	231	October 14, 2025
Response API - Duplicate response and Invalid JSON Bugs responses-api	5	312	September 3, 2025
Repetitive Responses in GPT-4o Chat Conversations API	2	1542	May 22, 2024

Intermittent Multiple Responses in Single Output with gpt-5.2-chat-latest

Related topics