GPT-4o Audio + Tool calls = API Bug

clad3815 · January 5, 2025, 11:03am

Actually if GPT include an audio with it’s tools_call the API will throw an error when sending the tool result.

The problem is when we have an assistant message like that (Tools call + Audio response)

    {
      "role": "assistant",
      "tool_calls": [
        {
          "id": "call_oHHuf6tqsySDjE6KIRSK1TvR",
          "type": "function",
          "function": {
            "name": "create_poll",
            "arguments": "{\"question\":\"Pain au chocolat ou beurre ?\",\"choices\":[\"Pain au chocolat\",\"Pain au beurre\"],\"duration\":300}"
          }
        }
      ],
      "audio": {
        "id": "audio_677a61d5d9d08190b1987ff1c1326c73"
      }
    },

The API will just ignore tool_ calls and will throw an error as we don’t provide it:

Invalid parameter: messages with role 'tool' must be a response to a preceeding message with 'tool_calls'.

The bug is on the playground also:

giorgiosilvi · January 10, 2025, 5:14pm

Found the same issue. only workaround for the moment is to literally exclude the audio from the saved message if tool_calls are present, but is obviously suboptimal

Topic		Replies	Views
Issues with gpt-4o-audio-preview when using tools/functions API tools , audio	1	344	November 12, 2024
GPT4 sending empty tool calls API bug , function-calling , assistants-api	1	620	April 2, 2024
Gpt-4o-audio-preview: including tools with async causes 500 response API	4	365	October 28, 2024
Strange JSON payload appended instead of tool being called Bugs	0	209	May 22, 2024
Bug: response.create returns audio only with a single response.input Bugs api , realtime	0	32	January 19, 2025

GPT-4o Audio + Tool calls = API Bug

Related topics