Function calling became unreliable

hrasko.gabor · October 9, 2024, 8:17am

A while ago I noticed that my gpt-4o application is not calling my tools consistently any more. Now I reverted back to gpt-4o-2024-05-13 and it is working fine and reliable.

I pass 2-3 tools in a call. The old model used them as expected, the new one calls them not reliable. I am not saying “randomly”, as it is not really random. In case of some prompt/tools it calls the tool(s) reliably, for other it never calls.

As for now, I could not figure out any systematic differences in those prompt/tool(s) items. The general structure of the prompts and the API call is consistent. Probably thenew model is sensitive on something of the particular tool parameters, but I could not make specific tests. With the spring LLM model, everything is fine.

I also added the tool_choice=“auto” argument for the calls for testing, but did not make any difference (this is the default anyhow).

Do you have similar experience?

josephkk · October 9, 2024, 4:40pm

2024-05-13 is the better model in my experience too.

Topic		Replies	Views
Strange Agent Behaviour With Tool Calling September 4 2024 Bugs gpt4o	4	664	September 5, 2024
Gpt-4o indicates in textual response that a tool was used, but did not include any toolCalls Prompting api , tools	2	382	October 11, 2024
Tool Calls Stopped Firing Automatically API gpt-4	1	587	April 30, 2024
Any issue with gpt-4o model recently relating to tool calls? Bugs gpt-4	7	1573	September 3, 2024
How to get API Tool Call to choose just one tool accurately API	6	3536	January 31, 2024

Function calling became unreliable

Related topics