Classifying Qs based on text. Which model is best? 4o or o3-mini?

jessica.cossgrove · February 25, 2025, 12:53pm

We are providing ChatGPT a text and a set of Qs which it needs to classify to ‘Q based on text’ or ‘Q out of text’. Noticed that 4o and o3-mini provide different results. Which model would you guys recommend for this Q classification task?

puffin · February 26, 2025, 12:18am

i’d run each 100/500/1000x times and grade accuracy on control data. i presume o3-mini might be slightly better, but cost an order of magnitude more.

Scarletioshub · February 26, 2025, 7:15am

GPT-4o generally has better reasoning and contextual understanding, making it more reliable for classification tasks. However, if speed and cost are priorities, GPT-3.5 (o3-mini) might be sufficient. Testing both on your dataset is the best way to determine which aligns with your needs.

Topic		Replies	Views
When do you wanna use 4o vs. o1 vs. o3-mini? Community chatgpt , api	3	19784	April 9, 2025
For synthetic data generation, does o3-mini, o1, or 4o generally fare better? API	2	764	February 7, 2025
Comparing GPT-4o and O3-Mini on same task Prompting chatgpt , gpt-4o , o3-mini	2	1541	March 14, 2025
When do you actually want to use 4o vs. 4o-mini API api	4	8487	January 24, 2025
Which model is best for speed and accuracy? API gpt-35-turbo , api , python , gpt-4o	8	25532	February 26, 2025

Classifying Qs based on text. Which model is best? 4o or o3-mini?

Related topics