I’m working on a project that requires multiple completions for a single prompt using the OpenAI API. I’m trying to understand how the token pricing works when requesting multiple completions (n>1) in a single API call. Specifically:
Are input (prompt) tokens charged once or multiple times based on the number of completions requested?
I’ve looked through the documentation but couldn’t find clear information on this specific topic. The documentation does say, “Note that you will be charged based on the number of generated tokens across all of the choices. Keep n as 1 to minimize costs.”