Can we get some clarification on tiers and latency?

It doesn’t seem to be reported as much as it was a year ago by users, when OpenAI made this dark pattern of degrading low-paying accounts even before they announced any tier system.

That “latency” language was stricken from documentation even when still affecting organizations.

There haven’t been many complaints recently by API users of targeted lower production rates. With “99% less expensive AI (for us) than two years ago” just spoken today at devday, there is likely less pressure to shuffle off particular developers to low-performance servers.

Which is good, as “pay the same per call as the rich, get less” is not a good company policy.

Scale tier is for buying $5000 increments of dedicated compute.

1 Like