O3 is 80% cheaper and introducing o3-pro

_j · June 12, 2025, 1:09am

One seeming benefit of o3-pro seen - at least you aren’t the one paying for hundreds of tokens of unseen decision and moderation as in other reasoning models.

O3-Pro

36 output tokens billed on 26 tokens received.

Still with apparent 85 tokens for vision base tile instead of 75 tokens.

O1-Pro

193 more output tokens billed than received:

Note the peculiar vision input billing of o1-pro, also seen in o1. An image 512x512 would be 1 tile (75 or 85 tokens) says the pricing guide. Here however, a detail:low image is always min/max 22 tokens with container overhead, and detail:high as showing is 41 tokens with its text. 512x513 is a jump to 63, 22 tokens more input. Perhaps a price break because of the stratospheric cost otherwise? At the very least, o1 is undisclosed and unpublished vision pricing formula.

Adding images has added latency of around 1-3 seconds across all other models. So still with these 15 second response times, there’s either a queue, there is unseen moderations or decisions before your billed task…or OpenAI figured out how to publish a model with 3 token-per-second generation rate.

Topic		Replies	Views
OpenAI dropped price of o3 API pricing , o3	21	1491	June 17, 2025
:rocket: o1 Pro in the API? o1 Pro is in the API! Community api , o1-pro-mode	5	1789	March 20, 2025
Introducing GPT-4o mini in the API Announcements gpt-4o-mini	63	31525	July 22, 2024
Launching o3-mini in the API Announcements	61	24327	February 10, 2025
Announcement: Release of o3 and o4 mini - April 16, 2025 Community announcement , chatgpt , api , o3 , o4-mini	16	3830	April 17, 2025

O3 is 80% cheaper and introducing o3-pro

O3-Pro

O1-Pro

Related topics