O3-mini vs 4o for a chatbot

Shintelo · February 4, 2025, 3:09am

Hello!
I just wanted to ask if the o3-mini is currently a better option for a chatbot than the 4o.
I’m asking this because of the big difference in price per token, as the o3-mini is literally less than half the price.

I really appreciate your response since, if the difference is minimal (or if I even hear that the o3-mini is actually better), I will definitely go for the o3-mini because of its price.

P.S.: I’m using an Assistant for the chatbot, and I understand that o3-mini is currently not available for the Assistant.

civilianemail · February 4, 2025, 4:11am

What purpose will your chatbot fulfill? o3-mini should have stronger reasoning skills while 4o should have broader world knowledge. If you only need it to say “Hello” and have conversations of no huge importance, you might even choose the very cheap 4o-mini.

edwinarbus · February 4, 2025, 4:20am

Nope, o3-mini is available for assistants today!

Although, what type of chatbot? To @civilianemail’s point, GPT-4o is best for generating fast text and chatting… o3-mini is great at logic, reasoning, etc.

Shintelo · February 4, 2025, 4:33am

@edwinarbus @civilianemail Oh, great news! However, I think that, as both of you said, I won’t need the o3-mini.

My chatbot is for an enterprise that sells products, so I have a lot of instructions in the “system instruction.” When I tested it with the 4o-mini, it worked, but it didn’t follow some specific orders.

Unrelated to the main question, but do you think I should create an assistant for each task and have another assistant decide which one to use at each moment? Or would a single assistant handling everything be fine?

I’m asking because I didn’t quite understand whether the system instructions count as input tokens, and also because I don’t want the assistant to get overwhelmed with too much text.

the.brainiac · February 4, 2025, 6:51am

On the face value, O3-mini is faster and cheaper than 4o.

However, O3-mini is reasoning model, it means you are paying not only for input and output but for the reasoning tokens as well and if your “reasoning effort” param is set to medium or high, there will be substantial reasoning tokens used.

You said you are building an enterprise level chatbot so I assume there will be rag involved across various company documents, and reasoning across these documents will cost a lot of reasoning tokens.

Due to this, it’s safe to say o3-mini’s price is on par with 4o.

So, if you want to optimize for speed, go for o3-mini.

leonelebi · February 4, 2025, 5:21pm

Is 03 mini ready to be used for assistants? I do not see it in the options.

greendsys · February 4, 2025, 8:47pm

input is 50% cached. so it might be a better if you need reasoning.

Shintelo · February 4, 2025, 9:08pm

Why would i use RAG, if the assistant has all the info in their system instructions, and can also use function calling?

tonikukoc.no7 · February 5, 2025, 5:44am

From OpenAI’s news page (https://openai.com/index/openai-o3-mini/).

o3-mini is rolling out in the Chat Completions API, Assistants API, and Batch API starting today to select developers in API usage tiers 3-5.

santinitoxd27 · February 6, 2025, 6:38pm

Hey, I’ve been trying to use o3 for assistants. I’m tier 3, but it does not appear as an option. How can I access it?

BrianLovesAI · February 11, 2025, 12:23am

1 vote for 4o (tested o3-mini but failed the test, 4o passed)

Topic		Replies	Views
Do you guys use 4o mini or o3 mini for your ai chatbots? API assistants-api	11	1210	March 4, 2025
Successor to 4o mini? When? API	9	648	April 5, 2025
When do you wanna use 4o vs. o1 vs. o3-mini? Community chatgpt , api	3	19664	April 9, 2025
Which model is best for speed and accuracy? API gpt-35-turbo , api , python , gpt-4o	8	22072	February 26, 2025
When do you actually want to use 4o vs. 4o-mini API api	4	8165	January 24, 2025

O3-mini vs 4o for a chatbot

Related topics