Non deterministic API/Playground GPT-4 responses breaks LangChain ReAct implementation

_j · September 14, 2023, 4:40pm

GPT-4, likely being a mixture of expert models and a synthesis of their results, acts a bit differently than gpt-3xx. One interpretation could be that temperature is scaled differently within specializations, or that there are components that still include multinomial selection.

The assertion that a tool is broken even at very low temperature is specious.

Let’s play with probabilities:

params = {"model": "gpt-4", "max_tokens": 1, "n": 60,
    "temperature": 0.2, "top_p": 0.99,
    "messages": [{"role": "system",
    "content": """Allowed output: only one word, a random choice of 'heads' or 'tails'.
Flip a virtual coin with equal outcome probability."""}]}
api = openai.ChatCompletion.create(**params)
flips = ''.join(choice["message"]["content"][0] for choice in api["choices"])
print(flips)

Sixty identical runs of gpt-4, with a result far more uncertain than how to write code.

We see the h=heads t=tails results:
ttttttttttttttttttthtttttttttttthtttthtththttttttttttttthttt

(it is actually quite hard to prompt equal probabilities without receiving back token logits)

I’m going to crank temperature to 2, but top_p to 0
tttttttttttttttttttttttttttttttttttttttttttttttttttttttttttt
tttttttttttttttttttttttttttttttttttttttttttttttttttttttttttt
tttttttttttttttttttttttttttttttttttttttttttttttttttttttttttt

Deterministic coin flips.

Topic		Replies	Views
Achieving deterministic API output on language models - HOWTO API statistics	3	7461	December 21, 2023
Why the API output is inconsistent even after the temperature is set to 0 API gpt-4	11	20849	December 21, 2023
ChatCompletions are not deterministic even with seed set, temperature=0, top_p=0, n=1 API gpt-4 , api	9	1438	October 7, 2024
Possible bug? gpt-3.5-turbo non-deterministic even with temperature zero API	4	4550	December 21, 2023
Deterministic Results Impossible for GPT-4o API gpt-4 , chat-completion , api-temperature , seed	6	437	December 19, 2024

Non deterministic API/Playground GPT-4 responses breaks LangChain ReAct implementation

Related topics