Introducing GPT-4o mini in the API

We just officially launched GPT-4o mini—our new affordable and intelligent small model that’s significantly smarter, cheaper, and just as fast as GPT-3.5 Turbo.

Some details on the new model:

  • Intelligence: GPT-4o mini outperforms GPT-3.5 Turbo in textual intelligence—scoring 82% on MMLU compared to 69.8%—and multimodal reasoning.
  • Price: GPT-4o mini is more than 60% cheaper than GPT-3.5 Turbo, priced at 15¢ per 1M input tokens and 60¢ per 1M output tokens (roughly the equivalent of 2500 pages in a standard book).
  • Modalities: GPT-4o mini currently supports text and vision capabilities, and we plan to add support for audio and video inputs and outputs in the future.
  • Languages: GPT-4o mini has improved multilingual understanding over GPT-3.5 Turbo across a wide range of non-English languages.

Like GPT-4o, GPT-4o mini has a 128k context window and a knowledge cut-off date of October 2023. It also supports up to 16,384 max_tokens.

We plan to launch fine-tuning for GPT-4o mini in the coming days. You can learn more about GPT-4o mini in our announcement blog and API documentation, or by testing the model in Playground. Excited to hear what you think!

39 Likes

Great news! Keep up the good work!

6 Likes

That sounds like a big jump compared to GPT 3.5 Turbo!

More context, more possibilities at a much lower price: Congrats to the team!

Looking forward to test it!

3 Likes

Super excited about this…BUT first initial tests: function calling is not as accurate as GPT-3.5 Turbo (which already was problematic), doesn’t seem to be following my schema set in tools as much as GPT-4o does.

I assume this was tested and deemed great, which makes me think perhaps there is a better way to write my tooling? Concrete, best practices for this newer model?? (which I would LOVE to work for me btw…)

3 Likes

I hope that this is the start of the near end for our GOAT GPT-4

Not an AI Expert guys but please fix the issues with GPT-4o the Context and the clarity points can be taken from Gemini-1.5-pro that model with 2M Token is a beast.

But I would still like to use GPT-4 over anything. However, how long before it out performs OPENAI in all use case is just a matter of time really.

We need to catch up

God Speed @jeffsharris and Team OPENAI

3 Likes

Congratulations on the launch @jeffsharris

This is great news!

I really like the pricing, multi-modal nature, and the response times that I’m seeing.

AI going :chart_with_upwards_trend:, token costs going :chart_with_downwards_trend:.

4 Likes

is it safe to say that 3.5 is going away and being phased out? I’m asking because one of my vendors has my plan mapped to 3.5, and I want an upgrade to 4o mini.

1 Like

Trying to distract us from banging our tables for GPT-5 :wink:

Just kidding.

This is a fantastic offering at this price, further expanding access to good language processing, excellent!!

2 Likes

It took me literally about 5 seconds to switch out the models, so it shouldn’t be a problem switching and reaping the benefits.

2 Likes

Maximum output token limit for gpt-4o-mini?

3 Likes

Amazing! @jeffsharris are there any plans to make gpt-4o-mini available in the finetuning API?

Cheers


Edit: ah whoops looks like your announcement said as much. Can’t wait!

In the blog post it mentions coming in a few days. :slight_smile:

Welcome to the community!

4 Likes

Thanks PaulBellow! I am such a fan of the OpenAI finetuning API. Much easier interface than some other API providers we could mention … :sleeping_bed: :rock:

1 Like

I’d like a “mini me”

Just less Dr. Evil…

1 Like

Ive noticed what i find is an issue, especially with using images in 4o mini , if i upload an image in normal 4o and ask a question, it uses qpprox 1000 input tokens and 200 output tokens, if i run the same task in 4o mini the total number of tokens is nearly 40000. Maybe its just me but i think something seens to be wrong. Anyone else have this issue?

2 Likes

I am currently leveraging GPT-4o mini in a Google Apps Script to enhance functionality within a spreadsheet. The script is used to summarize, correct grammar, improve spelling, translate text, and refine descriptions. Additionally, it provides detailed event and time information related to development activities and data analysis directly within the spreadsheet. By integrating GPT-4o mini, I have significantly improved the accuracy and efficiency of these tasks while reducing costs by over 60% compared to GPT-3.5 Turbo. This integration not only enhances productivity but also enriches the user experience, making complex data processing seamless and effective. I am grateful for the advanced capabilities of GPT-4o mini, which have been instrumental in achieving these improvements. Thank you to the OpenAI team for this powerful tool!

2 Likes

Can fine tuning please be available to everyone like 3.5 16k? Many of us never got GPT 4 fine tuning and it stopped all of my development of long form sample fine tuning.

2 Likes

Hey, good to see you!

Would love an update on your fine-tuning for fiction adventures if you ever have time.

Hope you’re well!

1 Like

The problem became everyone was stuck on 3.5 16k… we kept waiting for GPT 4 to open up… but it’s still “request” only. Not even turbo.

I’m hoping with this lower cost model they will again open up fine tuning to everyone…

We have new software we are debuting this summer that makes writing with AI seamless in-line. More like how Notion AI works, but allows the author to pick any model, including NSFW models.

If mini 4o can be fine tune open for everyone, most of us will use that to fine tune and make outlines and components. :slight_smile:

4 Likes

Is the API for assistants the same schema? I am getting internal server error, I figure this might blow something up or it is down.