Observing discrepancy in completions with temperature = 0

AgusPG · February 24, 2023, 2:05pm

This is a very interesting question that has been around for some time. In my view, the most comprehensive answer was given here: A question on determinism

Even though you are right in your hypothesis @sam_nabla, it doesn’t hold empirically. Even with a greedy decoding strategy, small discrepancies regarding floating point operations lead to divergent generations. In simpler terms: when the top-two tokens have very similar log-probs, there’s a non-zero probability of choosing the least probable one due to the finite number of digits that you’re using for multiplying probs and storing them.

It should also be noted that, as the decoding occurs in an autoregressive way, once you have picked a different token the whole generated sequence will diverge, as this choice affects to the probability of generating every subsequent token.

Hope that helps

Topic		Replies	Views
Why is GPT-4 giving different answers with same prompt & temperature=0? API	6	16607	April 6, 2023
Why the API output is inconsistent even after the temperature is set to 0 API gpt-4	11	24046	December 21, 2023
Run same query many times - different results API	11	8155	December 21, 2023
ChatCompletions are not deterministic even with seed set, temperature=0, top_p=0, n=1 API gpt-4 , api	9	1798	October 7, 2024
Possible bug? gpt-3.5-turbo non-deterministic even with temperature zero API	4	4609	December 21, 2023

Observing discrepancy in completions with temperature = 0

Related topics