Introducing GPT-4o mini in the API

merefield · July 19, 2024, 6:22am

The overall cost is the same per image. Perhaps it is just their way of representing the same cost with the lower per token cost? Which just highlights how much cheaper the text tokens are.

Stve_O · July 19, 2024, 6:28am

Yes, that could indeed be a hint. Great development!

anon22939549 · July 19, 2024, 7:34am

Try with a new API key if it’s still not showing for you.

thomas11 · July 19, 2024, 7:48am

Thx mate, I looked through my code and I realises I’m white washing specific models, so the issue was in my own code …

(Embarrassed blush!)

merefield · July 19, 2024, 7:50am

Do you mean “white listing”? That’s rather different!!

Btw the modern phrase we are supposed to use is “allow listing”

snicoli94 · July 19, 2024, 8:01am

Will it be possible to fine-tune this model?

artem.sukhliak · July 19, 2024, 8:21am

So, is GPT-4o also supports 16,384 max_tokens?

thomas11 · July 19, 2024, 8:29am

Hehe, yes
I meant white listing

sven.w · July 19, 2024, 8:39am

Yes, @jamie10 - exact same issue. I already thought it might be a fallback or some additional context, that mini needs.
If i do it in Chat comparrison, upload a picture and ask open AI to explain the Picture, GPT4o will use around 800 Tokes, GPT4o-mini will consume 25,000 tokens

jim · July 19, 2024, 8:46am

Replying to myself…turns out some of my functions needed some updating and care. I can gladly report that function calling on GPT-4o-mini is phenomenal and I’m so glad to be done with the nightmares of 3.5 this past year.

Thank you so much and looking forward to building more with it!

Sbgal · July 19, 2024, 10:46am

Is fine tuning not available yet or have I coded it wrong?

openai.BadRequestError: Error code: 400 - {'error': {'message': 'Model gpt-4o-mini is not available for fine-tuning or does not exist.', 'type': 'invalid_request_error', 'param': None, 'code': 'model_not_available'}}

vb · July 19, 2024, 10:49am

Fine-tuning will be available shortly for the 4o-Mini model.

matt.ffrench · July 19, 2024, 3:16pm

Is there any news on how much fine tuned models will cost at inference time?

anon22939549 · July 19, 2024, 5:04pm

Other fine-tuned models have generally been 4–6 times as expensive, I would expect Mini to likely follow that trend.

davidia · July 20, 2024, 7:44am

Although GPT-4o mini theoretically supports a 16k token output, my tests suggest that it appears to be constrained to around 4k tokens, even to the extent of generating hallucinations to stay within this limit. When I attempt to push it to extend beyond 4k tokens, the model often resorts to ellipses or claims to have generated content that doesn’t exist.

I could be mistaken in my assessment. Has anyone else tested this feature?

turbolucius · July 20, 2024, 11:20am

I’ve managed to get the model to generate ~10k tokens by telling it that it can do that and forcing it to write a story that “never ends”. I have no idea how useable it would be for more serious use cases though.

traviss · July 20, 2024, 1:28pm

how long will it take for the fine-tune to roll out? Is there a way to request access to it now?

01jonny01 · July 20, 2024, 11:06pm

Has anyone done any vision speed response testing between the 4o and 4o-mini api?

kalidasrat · July 21, 2024, 6:57pm

Thanks for the updat and this is very interesting
Can you tell us about the details of the pricing

grandell1234 · July 21, 2024, 7:07pm

You can find pricing at openai.com/api/pricing

Topic		Replies	Views
Test new 128k window on gpt-4-1106-preview API	29	18364	February 6, 2024
Realtime API updates — WebRTC, cheaper prices, 4o-mini, and more Announcements	26	7397	December 29, 2024
Launching o3-mini in the API Announcements	61	24238	February 10, 2025
API for image generation for gpt-4o model API image-generation , gpt-4o	46	50418	May 2, 2025
New models and developer products announced at DevDay Announcements announcement	70	17515	February 16, 2024

Introducing GPT-4o mini in the API

Related topics