New models are incapable of proper function calling

_j · February 21, 2024, 7:49pm

Yes, if you don’t want the parallel tool calls that new models can emit (for example, calling four simultaneous instances of “dice(roll:d6)” for its dungeon game), and won’t be using document retrieval that needs that capability, then the gpt-4-0613 trained simply on functions should work for you.

The greater ability and lower perplexity of the non-turbo gpt-4 model (even at the same temperature) should carry over into producing the tokens of function arguments.

prabhat.gupta · April 30, 2024, 1:54pm

I faced the same issue, After experimentation found that GPT-4-turbo is better than GPT-3.5-turbo in function calling

If you have budget constraints then GPT-3.5-turbo-16k is much better option.

That being said, Experiment for your use case

Topic		Replies	Views
Gpt-4o vs. gpt-4-turbo - function calling Bugs function-calling , gpt-4-turbo , gpt-4o	17	6057	September 23, 2024
Gpt-4-turbo is not respecting "required", while gpt-4-turbo-preview is API	14	1698	April 17, 2024
Issue with Assistant API (GPT-4.1) not consistently calling functions Bugs assistants-api	6	234	August 4, 2025
How to get API Tool Call to choose just one tool accurately API	6	3982	January 31, 2024
Function calls in 3.5-turbo-0613 for compound text processing produce unreliable results API functions	7	1949	December 18, 2023

New models are incapable of proper function calling

Related topics