Bug, or malice: Tier 1 cannot send even 1/3 of the GPT-4o context due to rate limit

_j · May 23, 2024, 4:02am

The tokens per minute of tier 1 was slashed from 300k to 30k for gpt-4o and gpt-4-turbo.

This means a request to a 128k model with input tokens 28k+ with a reasonable max_tokens will fail.

Assistants with retrieval or file_search also blows past this limit, with user failures also reported.

A user can’t spend $0.15 unless having paid up over $50, a wait, and pay again to recalculate?

Tier 1 can’t even queue a single 100k call overnight.

30000 vs 10 million per minute for tier-5 has nothing to do with trust or server load.

Topic		Replies	Views
GPT4 rate limit got lower? Bugs rate-limit	1	118	November 13, 2024
I don't know where where my tokens are being used. I think it is wrong API gpt-4 , api , gpt-4-turbo	12	2015	December 10, 2023
API \| Max Token Error \| Tier 4 \| Fluctuating between 128000 and 4096 Bugs api	3	3565	November 30, 2023
Assistant API: Run failed Rate limit reached API bug , assistants , assistants-api	3	2403	November 9, 2023
Gpt-4-1106-preview in Playground needs some fixes API gpt-4 , playground	24	17160	February 5, 2024