What is the token-limit of the new version GPT 4o?

jens_strecker · May 15, 2024, 9:08am

Does anybody know what the token-limit of the new version GPT 4o is? Unfortunately, during my research on this topic, I keep finding different pieces of information.

Neoony · May 15, 2024, 9:27am

128k tokens
https://platform.openai.com/docs/models/gpt-4o
thats for API

chatGPT is probably 32k, however it seems they removed the context from chatgpt pricing page
https://openai.com/chatgpt/pricing/

so Iam not sure if its still 32k in chatGPT
EDIT: ah wait, just scroll down more

Also from what I have seen, the tokenizer is now completely different so something like this will no longer show it correctly for gpt4o
https://platform.openai.com/tokenizer

oh actually it now says “gpt4o coming soon”

_j · May 15, 2024, 10:28am

GPT-4o, like other recent models, will not allow you to produce more than 4k of output tokens however, and is trained to curtail its responses even more than that.

PeachyWasTaken · May 15, 2024, 6:07pm

Does it the limit reset after some time? I heard that it resets every 24h

jr.2509 · May 15, 2024, 6:15pm

Welcome to the Forum!

Are you referring to rate limits, such as RPM (requests per minute), RPD (requests per day), TPM (tokens per minute), TPD (tokens per day)?

_j · May 15, 2024, 6:42pm

Once you have paid the token amount to use the API, there are no daily limits. Just per-minute limits that far exceed that needed for several people. The gpt-4o model has the highest limits yet, letting you know how little computational impact the model has (the quality in anything but chatting also lets you know how little computation powers it…)

Model	Tier 1 Token limits	Tier 1 Request limits
gpt-4o	30,000 TPM	500 RPM

…

Model	Tier 5 Token limits	Tier 5 Request limits
gpt-4o	5,000,000 TPM	10,000 RPM

The first tier, paying $5, the rate is sufficient to empty your balance in 20 minutes if you ran a script.

The actual way the rate limit algorithm works can be described like:

You start with a minute’s worth of requests;
Your bucket refills at the continuous rate after use.

Although you can also get limited if you try to make all 500 RPM the same second.

jens_strecker · May 16, 2024, 7:45am

I asking in regards to context lenght (not API) but browser based using GPT 4o on: chatgpt com

jr.2509 · May 16, 2024, 7:53am

My response was to a different post in your thread Unfortunately, I am also not fully certain about the context length for the new model on the ChatGPT interface.

AlbertWesker · May 16, 2024, 11:01am

I disagree that the 500 tokens per minute “far exceed what’s needed for several people”, I just made a translator app for myself and if the person speaks for a long time, it cuts it off, and we’re talking like 1:30 or 2 mins of speech, nothing unrealistic. Whisper aces it, writes it down, sends it for translation and the translation comes back truncated if it goes over the 500 tokens per minute.

_j · May 16, 2024, 12:07pm

That’s 500 REQUESTS per minute. Over 8 per second.

If you are getting truncation, you should check the max_tokens API parameters and log any error messages you receive - the rate limit is not subtle, you get denied further API calls.

AlbertWesker · May 16, 2024, 5:07pm

you’re absolutely right, that was it, my bad

Andrew_C · May 20, 2024, 4:01am

I also looked up information to find out if it was advisable to stop paying, but on Reddit I found users who say they reached 7, 13, or 16 messages before it switched from GPT-4 to @ 3.5.

And asking chatGPT4o directly, he searched the internet and it says 80 every 3 hours for a payment plan. 160 for team membership

I think the variation in the free plan is due to “variations depending on availability”

I hope i’ve helped

muntasirzisan · May 25, 2024, 6:20am

I had a doubt that the Plus plan shows 5x more messages. But what is 5x more messages? Like if I have 5 messages limit per day then I will have 25? Just 25? Or is it like I will have 25 per chat and no reset time? or which plan has no reset time?

Thank you in advance

ccjarvis0011 · May 25, 2024, 6:22am

i bought the subscription based version, 37.99 NZD a month. I didnt know anything about it or request limits. i just wanted to have philosophical conversations and discuss random tidbits. Do i have some kind of limit per day? i did use a lot of requests…?

_j · May 25, 2024, 6:31am

As mentioned right above the last two replies, the evaluation period for rate-limiting ChatGPT is three hours.

If you were given 80 message per 3 hours, then you’d not be able to receive an answer to question 81 until question 1 had expired by being asked over three hours ago.

muntasirzisan · May 25, 2024, 6:59am

So is it that there is only 3 hours limit from starting the first question?

And is it 80 chats or 80 questions in 1 chat?

_j · May 25, 2024, 7:06am

Well, any question. If you asked 40 GPT-4 questions over a four-hour period, the initial 10 you might have asked that are past hour three no longer count against the limit. A continuous rate of one input per 4.5 minutes. With GPT-4o counting against the limit half as much.

Per account.

muntasirzisan · May 25, 2024, 7:35am

Is it for ChatGPT Plus or Team? which one has more chats

_j · May 25, 2024, 8:30am

Teams is a higher tier, but requires at least two paid seats at the higher cost, and doesn’t have the required management to prevent users from just adding more users to a company.

Here’s where current and any changing limits would be: https://help.openai.com/en/articles/6950777-what-is-chatgpt-plus

vupham · July 4, 2024, 4:59am

I’m currently using the GPT-4 API with a 4K token limit, as confirmed in the Playground. How can I increase the maximum token count to 128K?

Topic		Replies	Views
GPT 4 Turbo is limited to 4K? API gpt-4	16	13954	April 9, 2024
Gpt-4-1106-preview in Playground needs some fixes API gpt-4 , playground	24	17126	February 5, 2024
Test new 128k window on gpt-4-1106-preview API	29	18266	February 6, 2024
Is the "output (Maximum length)" for the GPT-4-1106-preview API still capped at 4095? API gpt-4 , gpt-4-turbo	3	7527	November 15, 2023
4096 response limit vs 128 000 context window API	11	11299	February 6, 2025

What is the token-limit of the new version GPT 4o?

Related topics