New o3 and o4-mini models have reduced token limits in chat —Is this intended?

nachetedavila · April 17, 2025, 8:12am

Hey everyone,

I’ve noticed that the current token limit per message for the recently released o3 and o4-mini models is around 60-65K tokens in the chat app, which feels quite restrictive compared to previous models (now retired) that allowed up to approximately 100K tokens. Considering the theoretical limit is around 200K tokens, I’m wondering if this limitation is intentional or if it’s a bug.

I’m currently subscribed to ChatGPT Pro specifically for professional usage, and hitting this lower ceiling makes the tool almost unusable for my purposes. Even more frustrating, the macOS app has an even stricter limit, not allowing me to paste more than around 30K tokens per message.

Could someone from OpenAI clarify if these limits are permanent? If they are, I’ll have to reconsider the value of maintaining a Pro subscription, as these constraints seriously affect productivity.

Thanks in advance for any insights!

Tom_Potter · April 21, 2025, 12:05pm

This is serious problem for me as well. It means you have to split the code into more posts. Which I have found reduces the models correctness. It will also use more of the 50 responses per week limit. I honestly think I will have to try out other AI vendors if this cant be resolved. Is there a reason we cant just have a single post that can handle around 150k - 175k tokens? This would leave enough context left for output responses.

Karim_Abboud · April 21, 2025, 10:42pm

Sooo true. Is the new policy to hold back tokens?

clanspace003 · May 5, 2025, 2:38pm

nachetedavila:

Hey everyone,

I’ve noticed that the current token limit per message for the recently released o3 and o4-mini models is around 60-65K tokens in the chat app, which feels quite restrictive compared to previous models (now retired) that allowed up to approximately 100K tokens. Considering the theoretical limit is around 200K tokens, I’m wondering if this limitation is intentional or if it’s a bug.

I’m currently subscribed to ChatGPT Pro specifically for professional usage, and hitting this lower ceiling makes the tool almost unusable for my purposes. Even more frustrating, the macOS app has an even stricter limit, not allowing me to paste more than around 30K tokens per message.

Could someone from OpenAI clarify if these limits are permanent? If they are, I’ll have to reconsider the value of maintaining a Pro subscription, as these constraints seriously affect productivity.

Thanks in advance for any insights!

I’m experiencing the exact same issue. I’m also on the Pro plan, and right now it feels like even a 400-line code snippet gets cut off, which completely breaks the workflow.

What’s even more frustrating is that when trying to preview code in the interface, it won’t render more than ~200 lines—something previous models like o3-mini-high handled with no problem. This makes it really hard to use for any serious development work.

Considering this is a paid product that used to offer much more generous input and output lengths, these limitations feel like a step backward. If this is going to be the new standard, it seriously impacts the value of the Pro subscription.

JonathonZ · May 30, 2025, 12:51pm

I’ve found Gemini Pro 2.5 now handles with ease the code that o3 and any other model I’ve tried such as o4mini truncated.

It’s without complains, and with great speed handling changes to 800 line code that o3 cuts off around line 440.

Topic		Replies	Views
GPT-4 8k token limit is gone API gpt-4	4	6005	November 7, 2023
ChatGPT-4 Limits? Are they the same as for ChatGPT-3.5? API	12	8752	December 12, 2023
Why is gpt-3.5-turbo-1106 max_tokens limited to 4096? API	3	14169	January 11, 2024
GPT-4 128K only has 4096 completion tokens API gpt-4	9	27473	February 27, 2024
GPT 4 Turbo is limited to 4K? API gpt-4	16	14249	April 9, 2024

New o3 and o4-mini models have reduced token limits in chat —Is this intended?

Related topics