GPT-4o with function calling not reliable from Assistants-API

Braincorp · August 1, 2025, 9:41am

Hey there,

We created a SaaS application that leverages OpenAI GPT-4o to open tickets on carriers’ websites for 3PL companies.

Recently, the model started promising instead of acting, whereas it had been working without issue for almost a year. (“We will contact …” instead of Function Calling + “We have contacted …”)

We have lots of instructions, and we repeat the part where we ask the model to perform tool calling before responding to customers. But the rate at which it decides not to follow the instructions has increased, making it hard to use.

The prompt is made with XML tags, in the form of a set of rules and steps to follow.

Any idea on how we can reduce the rate at which it decides not to use function calling?

Cheers,
Kevin

_j · August 1, 2025, 4:45pm

OpenAI will deny that they alter the models. Yet production applications continue to break.

Assistants is a front-end to AI inference, so has multiple API specifications that can be haphazardly broken also.

OpenAI also now manipulate the model with injections of their own system messages, which in in the case of vision input triggering this, can be downright degrading of the quality.

The assistants API endpoint has a decided lack of parameters. The only direct mechanism to tune the probability is on chat completions, not using sampling parameters, and using a logit_bias against a discovered internal token number.

It sounds like it is working, but just not smart. You can talk directly to the AI and tell it precisely to send to a named function tool with the described output, and see that it is functional with a direct request.

The approach I would take is to review the function language itself, and ensure the purposes and reasoning for emitting to the function are present directly in the main description field. The style: “You immediately send to this function tool in response to any user input that implies a need for tracking their issues. etc. You cannot reply to the user in solving logistics problems without having sent to this tool first in your conversation and receiving its response with transient id.”

Topic		Replies	Views
Issue with Assistant API (GPT-4.1) not consistently calling functions Bugs assistants-api	6	532	August 4, 2025
Issue: Function calls work in GPT-4.0 but not in GPT-4.1 / GPT-4.1-2025-04-24 Prompting gpt-4 , gpt-41	1	130	December 15, 2025
OpenAI GPT Functions Not Calling – Debugging Help Needed Bugs function-calling	3	524	February 7, 2025
Model don't follow instructions with Function Calling API gpt-35-turbo , functions , function-calling	3	2379	August 24, 2023
GPT-4o cannot properly call custom functions more than half the time Bugs gpt-4o	21	6803	April 9, 2025

GPT-4o with function calling not reliable from Assistants-API

Related topics