Why is the temperature and top_p of o1 models fixed to 1 not 0?

dex3r · September 13, 2024, 11:57am

From: https://platform.openai.com/docs/guides/reasoning

temperature , top_p and n are fixed at 1 , while presence_penalty and frequency_penalty are fixed at 0 .

Why is temperature and top_p set to 1? This is a reasoning model, not a creative model. Wouldn’t setting temperature and top_p high increase the likelihood of hallucinations and selecting tokens that produce less likely true outcomes?

For me, that’s not just a theoretical prediction of what those values change in the output. In my experience across gpt-3, 3.5, 4, 4o, turbos, claude, phi, mistral, llamas, in eval environments, they all produce the best code (in terms of quality and in terms of sticking to the instructions) when temperature is 0 and top_p is very close to 0. I don’t mind non-creative and repetitive responses.

Please help me understand that choice for default temp and top_p.

dex3r · September 13, 2024, 8:53pm

Anyone from Open AI care to respond?

barrettvelker · September 16, 2024, 7:38pm

It’s possible the temperature is dynamically determined and changes throughout the “thinking” and “reflection” stages.

_j · September 16, 2024, 7:56pm

Possible, although there’s been temperature-looking problems reported when using other languages and getting oddball 3rd language tokens, so the creativity might be used in the wrong place.

Then if you want variations to evaluate a best response from, temperature is also important, like the best_of API parameter for completions (that uses logit probability totals instead of AI to judge). best_of: 10 is wasting your money with no sampling variety.

The poor-mans version from last year…

Topic		Replies	Views
Ask about GPT4 temperature? API gpt-4	1	3422	September 4, 2023
Does temperature go to 1 or 2? API	6	27575	January 12, 2024
Temperature, top_p and top_k for chatbot responses Prompting gpt-4 , chatgpt , api , api-temperature	10	122375	December 13, 2023
What is the top_p default value? API	10	14273	February 6, 2024
Temperature and top_p interactions? API	3	6161	February 3, 2024

Why is the temperature and top_p of o1 models fixed to 1 not 0?

Related topics