How does `n` parameter work in chat completions?

geoff3 · August 16, 2023, 8:45pm

Agreed, today for me, it seems that RateLimitError computes token count as prompt length * N (I get this error when setting large N, e.g. n=100). But if I submit a smaller value (say n=10) and look at the chat.completion object under “usage”, it computes prompt_tokens and total_tokens as expected (described in the accepted answer)

Topic		Replies	Views
Questions on setting n and max_token API	4	911	March 20, 2024
Clarification on token pricing for multiple completions (n>1) in a single API call" API pricing	1	418	July 3, 2024
Do I need to increase `max_tokens` when using `n>1` e.g. `n=3` for generating multiple chat completions API	8	2065	July 2, 2023
Is the max_tokens parameter of the completions endpoint applicable for ALL or EACH response? API	7	2276	July 3, 2023
Multiple prompt responses everywhere API	6	3644	December 25, 2023

How does `n` parameter work in chat completions?

Related topics