Introducing GPT-4o mini in the API

jim · July 18, 2024, 10:17pm

Looks like its down (both gpt-4o and gpt-4o-mini)

sashirestela · July 18, 2024, 10:50pm

I don’t think so, at least gpt-4o-mini is working like a charm, I’ve just tried with my RAG application.

silverLion · July 18, 2024, 10:54pm

From the announcement article found here:

Today, GPT-4o mini supports text and vision in the API, with support for text, image, video and audio inputs and outputs coming in the future.

The wording confuses me a bit. Does this mean text, image, video, and audio I/O is currently supported, or it’s still in development? I assume it’s latter but want to confirm.

Excited to try out this release in our systems!

anon22939549 · July 18, 2024, 11:01pm

16383 in the playground.

anon22939549 · July 18, 2024, 11:03pm

Only text and vision are currently available.

silverLion · July 18, 2024, 11:04pm

Gotcha. Thank you for confirming!

kieran3 · July 18, 2024, 11:06pm

Is the pricing page accurate in saying the vision pricing is the same between 4o and 4o-mini?

supershaneski · July 18, 2024, 11:27pm

when i got the email notification about the new model, the name made me wonder, why mini? reading thru various info from the site, i never thought that there is small models category. what does smaller tasks mean? anyway, the name reminds me of a car ad in japan which i rephrase here for gpt-4o mini: 小さなモデル、大きな能力！GPT-4o ミニ。

SomebodySysop · July 18, 2024, 11:50pm

However, that is “The maximum number of tokens to generate shared between the prompt and completion. The exact limit varies by model. (One token is roughly 4 characters for standard English text)”

What is the total maximum output tokens for API? For gpt-4o it is still, as far as I know, only 4K tokens.

anon22939549 · July 19, 2024, 12:22am

This is incorrect.

The maximum it can generate is reported to be 16,384 in the OP and I am seeing max_tokens of up to 16,383 in the playground.

I looks like the didn’t update the max_tokens tooltip in the playground when they changed the parameter’s meaning.

curiousmind · July 19, 2024, 1:25am

When using gtp-4o-mini vision capabilities, I see a 25x increase of input tokens compared to gtp-4o for the same picture. Is this a bug or intended behavior?

anon10827405 · July 19, 2024, 1:33am

Intended. The cost of reading an image is the same. The text cost is different

kieran3 · July 19, 2024, 1:42am

It would seem the cost of processing an image is the same, so to make up the value in tokens, the base token count is increased: see the breakdown here:
https://openai.com/api/pricing/

curiousmind · July 19, 2024, 1:46am

Is the token count artificially inflated to not cannibalize gtp-4o vision sales or is there a technical necessity for it? In practice gtp-4o-mini is still ~40% cheaper for me but performs worse, so will probably not switch for now.

ei444u · July 19, 2024, 2:39am

In order to truly advance AI and human society, GTP should focus on mobile apps, as mobile phones are the most frequently used devices worldwide. I hope that this GTP mini can be upgraded in an all-round way, especially in the beautification of language architecture and thinking expression, otherwise it will be replaced soon.

supershaneski · July 19, 2024, 2:49am

Your comment prompted me to check my usage… whoah!

t3cobi · July 19, 2024, 3:00am

Nice. Thank you. We will be deploying tomorrow. Love your work and grateful for the powerful tool.

notthefuzz · July 19, 2024, 5:13am

Is the python library for the openai api updated?
gpt-4o-mini seems to be missing in the assistants.py file in version 1.35.15. openai-python/src/openai/resources/beta/assistants.py

thomas11 · July 19, 2024, 5:59am

Why am I not finding this model when I retrieve a list of all available models using the API? Anybody knows …?

Stve_O · July 19, 2024, 6:13am

I’ve been testing mini with our image rec. app. It’s working well so far, but what I don’t understand is the number of tokens.
A typical request requires 1,757 tokens with 4-o. The same request with mini is 18,236 tokens.
The price is reduced…but why are there so many tokens?

Topic		Replies	Views
Test new 128k window on gpt-4-1106-preview API	29	18367	February 6, 2024
Realtime API updates — WebRTC, cheaper prices, 4o-mini, and more Announcements	26	7401	December 29, 2024
Launching o3-mini in the API Announcements	61	24259	February 10, 2025
API for image generation for gpt-4o model API image-generation , gpt-4o	46	50510	May 2, 2025
New models and developer products announced at DevDay Announcements announcement	70	17516	February 16, 2024

Introducing GPT-4o mini in the API

Related topics