Any issue with gpt-4o model recently relating to tool calls?

jpac_007 · September 2, 2024, 1:09pm

i noticed some inconsistency in the tool call recently,

{
        "type": "function",
        "function": {
            "name": "verify_user",
            "description": "requires user to input their account number",
            "parameters": {
                "type": "object",
                "properties": {
                    "account_number": {
                        "description": "the user's account number",
                        "type": "string"
                    }
                },
                "required": [
                    "account_number"
                ]
            }
        }

until yesterday, the model always ask the user for their account number, now, it assume that it already know the account number and filling a default either:

function args {‘account_number’: ‘xxxxxx’}
or
function args {‘account_number’: ‘123456’}

i tried gpt-4-turbo to test and seems to be working fine

MrFriday · September 2, 2024, 4:05pm

This function call in Chat Completion or Assistant APIs? If you’re using Chat Completion, use this is System Instruction:

Don’t make assumptions about what values to plug into functions. Ask for clarification if a user request is ambiguous.

If you are using Assistant APIs, put this above in ‘additional instruction’ of your ‘run’.

Let me know if this doesn’t fix your issue. We can try more things.

jim · September 2, 2024, 4:11pm

Yes definitely, especially gpt-4o-mini with Structured Outputs with Assistants. Increasing degradation over the past week. 4o seems more stable.

serg0812 · September 2, 2024, 7:21pm

I have the same issue, I assume. When i started feeding to the chat model some data, it understands it, but runs the tool several times and then stops due to large number of agent execution. It fetches ok though from the chat itself. I am using langchain. But when i changed the model from o to turbo - all worked. The 4o still is handling prompts ok and provides the desired more less output via promps, but fails with functions. I think it is a bug on OpenAI site, they claim to have 4o as the more stable and the latest, but in reality turbo works better and way more stable. This issue I have seen before from time to time, but from 30.08 it is consistent and even forced me to re-write the code today 2 hours before presentation. Is there any intentional degradation of the model?

serg0812 · September 2, 2024, 10:08pm

Uodate. It started working with august version of gpt-4o. it would be nice for OpenAI to notify about the changes of current gpt-4o version

jpac_007 · September 2, 2024, 10:44pm

I’m using chat completion, and yes my prompt always has that instruction since, basically it was 100% working until yesterday with no code changes.

jpac_007 · September 2, 2024, 10:45pm

I was looking at this too, I saw the structure changed, I updated to this structure but still the same issue, unless I changed to Turbo then it works however other issue also exist in turbo i.e. function to handover to human suddenly doesn’t get called.

jpac_007 · September 3, 2024, 10:21pm

Update, I switched to gpt-4o-2024-08-06 model and seems to be stable after several testing. Hopefully Openai fixes this issue in the current model.

Topic		Replies	Views
Strange Agent Behaviour With Tool Calling September 4 2024 Bugs gpt4o	4	750	September 5, 2024
Gpt-4o vs. gpt-4-turbo - function calling Bugs function-calling , gpt-4-turbo , gpt-4o	17	5922	September 23, 2024
GPT-4o cannot properly call custom functions more than half the time Bugs gpt-4o	21	6226	April 9, 2025
Problem with the new gpt-4-turbo-2024-04-09 API gpt-4 , api	6	4797	April 19, 2024
Huge quality drop in gpt-4-turbo Bugs	13	1116	May 30, 2024

Any issue with gpt-4o model recently relating to tool calls?

Related topics