Max tokens of Azure Open AI model

joyasree78 · May 11, 2023, 11:40pm

Hi
I printed the below

print(AzureOpenAI(deployment_name=xx, temperature=0.2)
it shows max_token as 256. But when I get the response, I see tokens >1000. How is that possible. Does the model then ignore max token

Thanks

lachie1 · May 11, 2023, 11:53pm

Are you setting max_tokens? API Reference - OpenAI API

joyasree78 · May 12, 2023, 12:07am

No I am not, it is the default 256 tokens

lachie1 · May 12, 2023, 12:13am

No, max_tokens default is infinite. Below is from the documentation.

max_tokens integer Optional Defaults to inf

The maximum number of tokens to generate in the chat completion.

The total length of input tokens and generated tokens is limited by the model’s context length.

joyasree78 · May 12, 2023, 12:27am

This is very confusing . The below link says it defaults to 16

lachie1 · May 12, 2023, 12:32am

That is for Create Completion not Chat. Scroll down further to Chat

joyasree78 · May 12, 2023, 12:33am

I am using completion(text-davinci) only not chat. Davinci is completion, right?

DavidOS366 · May 12, 2023, 1:45am

So there’s these APIs
https://api.openai.com/v1/models/davinci/completions → this one is for DaVinci Completions.

https://api.openai.com/v1/chat/completions → this one is for,if you want to simulate the experience that you get from using the ChatGPT bot. Remember, its a full fledged bot, so it will have interactions. Completions might be just very straightforward.

Either way. Both need excellent prompts.
I learned this difference the hard way.

Maybe in your payload prompt the number of tokens exceeds 1000.
I encountered this message once, when I set the token max to 2048. In reality it is your prompt tokens + x <= 2048. x is task tokens.

Topic		Replies	Views
Clarification for max_tokens API codex	10	96259	December 12, 2023
I need help using openai API API chatgpt , gpt-4o-mini	2	208	October 29, 2024
What exactly is "MAX TOKENS" in gpt-3.5-turbo model? API	2	16443	July 11, 2023
Doubt on prompt tokens and completion tokens API api	2	1114	April 18, 2024
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	13831	January 11, 2024

Max tokens of Azure Open AI model

Related topics