O3 is 80% cheaper and introducing o3-pro

_j · June 11, 2025, 12:55pm

I think you mean “minutes”, not “ms”.

A simple input of one image to o3 pro and a small token billing was an extensive wait.

I suspect that “no flex processing discount” is because that is already being done by “inference efficiencies” also coming to this model: if you’re willing to wait for a long response, you’ll also be waiting behind other jobs transparently for fitting the API call into a queue…

Topic		Replies	Views
O1- response time has been terrible lately API	1	88	June 18, 2025
OpenAI dropped price of o3 API pricing , o3	21	1904	June 17, 2025
Introducing GPT-4o mini in the API Announcements gpt-4o-mini	63	31844	July 22, 2024
Deep research in the API, webhooks, and web search with o3 Announcements	21	4536	July 7, 2025
Introducing OpenAI o1-preview \| New OpenAI Announcement API announcement , chatgpt , news	39	6159	September 17, 2024

O3 is 80% cheaper and introducing o3-pro

Related topics