In API default top_p shows better results than top_p=1 (default value)

januvojta · March 13, 2024, 8:35am

When using gpt-3.5-turbo-0125, the prompt returns better results when no top_p is set (it is set to default) than when it is set to 1 (which should be the default value). I tested it on around 200 examples (to eliminate nondeterministic behaviors of gpt models), and it showed around 10% improvement on categorization task. Does anyone have a similar experience?

The setups are:

response = client.chat.completions.create(
    model="gpt-3.5-turbo-0125",
    response_format={ "type": "json_object" },
    seed=42,
    temperature=0,
    max_tokens=250,
    frequency_penalty=0,
    presence_penalty=0,
    messages=msgs,
)

vs.

response = client.chat.completions.create(
    model="gpt-3.5-turbo-0125",
    response_format={ "type": "json_object" },
    seed=42,
    temperature=0,
    top_p=1,
    max_tokens=250,
    frequency_penalty=0,
    presence_penalty=0,
    messages=msgs,
)

Topic		Replies	Views
What is the best temperature and top_p for a chatbot? API gpt-35-turbo , prompt	4	2646	December 21, 2023
API Completions not really matching with chat.openAI GPT-3.5 Completions API gpt-35-turbo , chatgpt , api	7	2258	December 17, 2023
ChatGPT gives much better response than API GPT 3.5 Turbo API	4	1874	May 16, 2023
Temperature, top_p and top_k for chatbot responses Prompting gpt-4 , chatgpt , api , api-temperature	10	48596	December 13, 2023
ChatGPT and API results are quite different API chatgpt , api	5	2464	December 18, 2023

In API default top_p shows better results than top_p=1 (default value)

Related Topics