Questions on setting n and max_token

fishorse · March 20, 2024, 9:42am

What is the difference between setting the parameter n and sending the same requests n times? e.g., if I set n=5, and get 5 choices, how does this differ from I send the request 5 times?
Will max_token impact the length of the output, besides cutting it when it exceeds the limit? e.g., if I set max_token to a small number versus a large number, will the response I get differ in the length significantly?

Thanks!

Diet · March 20, 2024, 9:47am

Welcome to the community!

I think if you use the n parameter you only pay for the input tokens once. if you make 5 calls, you pay 5 times for both output and input. But I could be wrong, documentation on that is becoming spotty. The utility of that is pretty limited.

Nope. It will just cut it off. It has absolutely no bearing on the quality of the generation.

fishorse · March 20, 2024, 10:08am

Thank you! I get the cost part of setting n. But will it impact the quality / similarity of the responses I get, assuming all the other parameters are the same?

Diet · March 20, 2024, 11:01am

It’s a tough question.

Generally not really, although you may get different model fingerprints with different calls, giving you slightly different results. https://platform.openai.com/docs/api-reference/chat/create#chat-create-seed

I wouldn’t worry about it.

vb · March 20, 2024, 11:11am

Your first question has been answered and explored here:

This is helpful when you want to check if the model replies according to your expectations. One can send the same request 10- 10,000 times and evaluate the replies, assessing if the deviation from the required reply quality is 1% or 5%.

That’s going to save time and money.

Topic		Replies	Views
How does `n` parameter work in chat completions? API gpt-35-turbo , chatgpt , api	12	12793	December 10, 2023
Do I need to increase `max_tokens` when using `n>1` e.g. `n=3` for generating multiple chat completions API	8	2052	July 2, 2023
Is the max_tokens parameter of the completions endpoint applicable for ALL or EACH response? API	7	2239	July 3, 2023
Question regarding max_tokens Prompting	11	37303	December 13, 2023
MAX TOKENS is 4,096 tokens for gpt-3.5-turbo should fit the the messages sent and the answer generated? API api	10	6139	December 18, 2023

Questions on setting n and max_token

Related topics