Strange unicode characters in response of ResponseAPI

_j · June 3, 2025, 8:01pm

This was seen and just commented on in another thread.

While the gpt-4.1 model is symptomatic, I propose tackling it head-on, with the name itself indicating the structured output response’s purpose and type of character set.

{
  "name": "api_utf_8_json",
  "schema": {
    "type": "object",
    "properties": {
      "products": {
        "type": "array",
        "description": "A list of products.",
        "items": {
          "type": "object",
          "properties": {
            "Eans": {
              "type": "array",
              "description": "List of EANs for the product.",
              "items": {
                "type": "string"
              }
            },
            "Name": {
              "type": "string",
              "description": "The name of the product."
            },
            "LabelText": {
              "type": "string",
              "description": "Label text providing details about the product."
            },
            "UsedUrls": {
              "type": "array",
              "description": "List of URLs associated with the product.",
              "items": {
                "type": "string"
              }
            },
            "ProductId": {
              "type": "string",
              "description": "Unique identifier for the product."
            },
            "ProductName": {
              "type": "string",
              "description": "The product's name."
            },
            "LongDescription": {
              "type": "string",
              "description": "A detailed description of the product."
            },
            "ShortDescription": {
              "type": "string",
              "description": "A brief description of the product."
            }
          },
          "required": [
            "Eans",
            "Name",
            "LabelText",
            "UsedUrls",
            "ProductId",
            "ProductName",
            "LongDescription",
            "ShortDescription"
          ],
          "additionalProperties": false
        }
      }
    },
    "required": [
      "products"
    ],
    "additionalProperties": false
  },
  "strict": true
}

Topic		Replies	Views
GPT-4o returning malformed Unicode like \u0000e6 instead of æ — encoding bug? Bugs	1	389	July 25, 2025
Wrong encoding for gpt-4o during API Chat completion Bugs	2	1578	May 15, 2024
Model returning malformed characters in JSON response using API Bugs	6	426	November 21, 2025
Support of unicode in gpt4-1106-preview Bugs gpt-4 , api	10	2493	November 15, 2024
Gpt-4-1106-preview is not generating utf-8 API gpt-4-turbo	8	9242	February 17, 2024

Strange unicode characters in response of ResponseAPI

Related topics