How does gpt-4-1-mini compare with gpt-4o-mini for tool calling

Plutes · June 12, 2025, 2:43am

How does gpt-4-1-mini compare to gpt-4o-mini for selecting which function to call and setting the selected function’s parameters?

I’ve noticed that gpt-4o-mini may confabulate an argument that does not exist or try to set its value with an improper type. I’ve also noticed that it will often pick the wrong tool when two tools are too similar. I’m curious how the current slate of API models rank according to tool handling.

ptkbhv · July 25, 2025, 7:03am

We found 4.1-mini to be way ahead of 4o-mini. You can check the performance in Galileo Agent Leaderboard v2.

Plutes · July 25, 2025, 5:44pm

Super helpful — thanks.

Topic		Replies	Views
Who is gpt-4o agent successor in the gpt-5 serie? API azure , azure-openai	2	199	October 10, 2025
Tool selection vs. argument setting API tools , gpt-41-mini , responses-api	0	61	October 20, 2025
Which model is best for speed and accuracy? API gpt-35-turbo , api , python , gpt-4o	8	24131	February 26, 2025
Inconsistent tool calling on GPT-4o & GPT-4.1 API function-calling , tools	0	119	November 14, 2025
When do you actually want to use 4o vs. 4o-mini API api	4	8301	January 24, 2025

How does gpt-4-1-mini compare with gpt-4o-mini for tool calling

Related topics