Hello I’m sending this message: What is the most beautiful country?
I’m sending it as a json object {“role”:“user”,“content”:“What is the most beautiful country?”}
I thought it would return like 7 tokens for the prompt but it doesn’t.
It is returning 15 tokens for the prompt. Is that correct or it shouldn’t be returning that amount? Even when sending just a dot “.” as a message it is returning like 9 tokens for the prompt.
You can use max_tokens to limit the amount of tokens generated.
You can set a system message specifying the kind of response you want. e.g. terse, short, concise. OR specify how many tokens you want. e.g. “Answer in 5 tokens” in the system message.