GPT-4.1: Inconsistent Instruction Following and Poor Tool Invocation Behavior

We’re experiencing unreliable behavior from the GPT-4.1 model when used via the OpenAI API, specifically related to its function/tool calling behavior. The model frequently fails to follow well-defined instructions or skips invoking tools even when the schema and prompts are correctly structured.