Gpt4 token usage not using more than 3000 tokens even though it’s listed at much higher availability

campnorth9 · April 17, 2023, 2:36am

Hello, I am having issues with gpt4 and trying to make larger outputs. I use the full token amount “8,192 tokens” but it returns a server error. But when I use under 3000 tokens it is fine. I have tried very small prompts like “write a sentence” and it doesn’t matter it always returns an error. Is there some limitation on token count right now?

campnorth9 · April 17, 2023, 3:17pm

also to clarify more i have tried using 8k tokens all the way dow to 3k tokens and nothing in between there works until i reach 3k tokens

Cytranic · June 14, 2023, 3:40am

Get creative. “Write 3 paragraphs for 5 random topics, then repeat and generate 5 more topics until you reach 50 topics”

_j · June 14, 2023, 5:29am

Cytranic

Surely you can do better than to bump two-month-old threads with advice unrelated to the issue.

This is clearly that that the user didn’t understand that “max_tokens” as a response cannot go up to the model’s context length, because length is shared with the input during generation and is also required for the promping provided, or that their application’s chat history was also consuming tokens.
Actually catching the API error would likely provide the answer.

advocateone · August 24, 2023, 2:28am

the issue is that gpt doesn’t keep any state/memory, so if you’re looking to create a complex response, you’ve got to provide all the input system/user/assistant every time. Otherwise, I’d love to be creative and overcome this issue.

_j · August 24, 2023, 6:56am

Surely you can do better than to bump two-month-old threads with advice unrelated to the issue.

You reply to someone that didn’t understand the initial concern, and answer neither the initial question nor improve on the misunderstanding of the poster you replied to.

advocateone · August 24, 2023, 8:56pm

Yesterday was my first day on here, so please cut me some slack. You might have missed your calling as a litigator, which is part of what I do for a living. Are you an admin, or do you work for Open AI?

_j · August 24, 2023, 9:02pm

I just found humor in my prior response about someone bumping an old thread being immediately applicable again…

You can see my answer provided earlier: the symptom of the asker four months ago is similar to your recent issues in confusion over the use of max_tokens.

I, like all here, am just an enthusiast. For those browsing these topics in the future, it is useful to point out “not an answer” answers.

_j · August 24, 2023, 10:06pm

I appreciate the gratitude you’ve given in the other thread. The only compensation I get for sitting down and typing what couldn’t be ferreted out of documentation is knowing someone was helped.

Likewise, I want to see others helped usefully.

If there is terseness perceived, it is that I don’t assume myself an expert, only experienced, while you might actually be the expert in certain areas. Approaching the conversation as to another programmer and developer similar to myself can also mean assuming you don’t need basics or background for understanding, basics which could instead be seen as patronizing or condescending.

Here’s an emoji to let you know we are comrades

advocateone · August 24, 2023, 11:54pm

Thanks, I appreciate your help yesterday.

Foxalabs · August 25, 2023, 10:14am

Hi!

I am a Community Champion, I help champion the needs and pain points of developers so that OpenAI can optimise their time and resources. I do not work for OpenAI, I consult for clients who wish to understand how they might be affected by AI, and OpenAI’s services in particular.

Consider your slack, cut.

advocateone · August 25, 2023, 5:38pm

Thanks for your query. I thought that max_tokens included the user’s prompt and the response b/c of how the OpenAI docs define it. J informed me that it was just the output, so I wasn’t allowing enough context length for the prompt input. This was accurate and I was able to resolve it by resetting the max tokens parameter in the api.

Topic		Replies	Views
Not allowed to have all 8192 tokens API gpt-4	16	11492	December 18, 2023
Not enough tokens error, even though I've paid A LOT (maximum context length error) API api	5	5858	September 9, 2023
Struggling with max_tokens and getting responses within a given limit, please help! API chatgpt	5	19497	October 28, 2023
API \| Max Token Error \| Tier 4 \| Fluctuating between 128000 and 4096 Bugs api	3	3560	November 30, 2023
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	14021	January 11, 2024

Gpt4 token usage not using more than 3000 tokens even though it’s listed at much higher availability

Related topics