What is the token-limit of the new version GPT 4o?

jr.2509 · July 4, 2024, 7:44am

The 4k token limit refers to the output token limit which is the same across all of the latest models.

The 128k, on the other hand, refers to the total token limit (or context window), which is shared between the input and output tokens. Both gpt-4-turbo models and gpt-4o have a 128k limit/context window while the original gpt-4 has an 8k token limit.

I hope that answers your question.

vupham · July 4, 2024, 9:04am

Thanks for clarifying. I see in the screenshot below that my maximum context window is only 4096 tokens, not 128k. I believe this might depend on the account type. Do you know how to upgrade to the 128k token version? Perhaps contacting OpenAI support would be helpful?

jr.2509 · July 4, 2024, 9:19am

As said, maximum tokens here refers to the output tokens. This is the same across all of the latest models as also shown in the model overview here.

To give you an example: If you are trying to get the model to produce an output of 1,000 tokens, you could include up to 127k input tokens as part of your system and user messages. So the sum of the input and output tokens cannot exceed 128k. However, the output tokens - at this point in the model evolution - is always limited to 4k.

Bottom line: There is no upgrade and there is no need to reach out to OpenAI support.

vupham · July 4, 2024, 9:40am

The screenshot I showed you highlights the “total context window” setting, not the output.
For instance, my other model, the 3.5 Turbo 16k, has a token limit of 16,000, the screenshot below shows the context window setting.
I used Poe, which offers both GPT-4o (with a 4,000 token context window) and GPT-4o-128k (with a 128,000 token context window).

vupham · July 4, 2024, 9:44am

My Poe account has two GPT-4o models.

jr.2509 · July 4, 2024, 10:15am

I can’t speak specifically to the Poe account you are referencing - however, the information / advice provided to you remains unchanged.

The discrepancies in terms of token limit you are seeing in the Playground is likely a bug. You can rely on the information in the model overview as per the link I shared which specify all relevant details by individual model.

plbscr · July 19, 2024, 12:51am

I was looking forward for this answer for SEVERAL hours. Tyvm. Do you know if ScholarGPT ( ChatGPT - Scholar GPT ) uses GPT-4 or GPT-4o? I really need this answered so I can move on to my research project.

I need to know ScholarGPT Context window and GPT-4o context window (and if possible, GPT-4o mini).

Thanks a bunch!

grandell1234 · July 19, 2024, 1:18am

All GPT’s now use GPT-4o.

plbscr · July 19, 2024, 4:28am

I’d love to read more about it. Do you have a source? Does this mean I have a 128k context window with all custom GPTs now? That’d be awesome.

Also, I’m curious if we still have a limit on messages with GPT-4 and GPT-4o, like before (it was like 40 every 3 hours or something).

Thanks in advance

grandell1234 · July 19, 2024, 4:44am

No ChatGPT is manually limited to 16k.

Yeah, but it is now 80 messages

I guess myself.

plbscr · July 19, 2024, 4:51am

That’s great news, actually! I was considering whether to use Scholar GPT in the web for a research project. 16k token cap works for me. This will actually make my job like 10 times easier, no joke.

Tyvm, sir!

plbscr · July 19, 2024, 4:59am

I’m confused what you mean by ‘manually’ (sorry I’m just starting to learn about AI), is it the normal limit on the web browser without API or anything similar?

abdulbasit1700 · July 19, 2024, 6:02pm

What does it mean by “Output is always limited to 4k”?

can we not change it? I have paid ChatGPT subscription.

grandell1234 · July 19, 2024, 8:53pm

You can change it by getting a different plan, though Team and Enterprise won’t work for most people.

bodhisatra · October 1, 2024, 1:26am

I have no idea what language you’re speaking in. Is there any place that talks in laymen terms what “128k context lengths” mean in english? Does it have to do with how many characters I can use in my questions?

plbscr · October 24, 2024, 8:08pm

It refers to this: https://platform.openai.com/tokenizer

Topic		Replies	Views
GPT 4 Turbo is limited to 4K? API gpt-4	16	14230	April 9, 2024
Gpt-4-1106-preview in Playground needs some fixes API gpt-4 , playground	24	17187	February 5, 2024
Is the "output (Maximum length)" for the GPT-4-1106-preview API still capped at 4095? API gpt-4 , gpt-4-turbo	3	7644	November 15, 2023
4096 response limit vs 128 000 context window API	11	13773	February 6, 2025
I wish that when using the GPT API, it would be possible to have a contextual conversation like chatGPT API	14	7183	December 18, 2023

What is the token-limit of the new version GPT 4o?

Related topics