`json_mode` returns no JSON arrays

Rockyo · November 8, 2023, 2:07am

Setting response_format to { type: "json_object" } seems only returns singular {"key1": "value1", "key2": "value2", ...} JSON object than array like [{"key1": "value1", "key2": "value2"}, {"key1": "value3", "key2": "value4"}, ...], unless the array is encapsulated as a value within a key-value pair. This persists even the prompt explicitly requests the output to be a JSON array. Although it can be tackled by parsing the JSON object to extract the array, thought it’s worth noting for clarification. Would be great if future API includes direct JSON array responses to save some extra parsing.

scottb · November 8, 2023, 10:57am

I’m also seeing this issue - I need it to return [{"key1": "value1"}, {"key1": "value1"} ...]

and instead it just returns a single item, not contained within an array e.g.

{"key1": "value1"}

It also seems to max out tokens.

Turning off json_object makes it work fine, but then we’re back to the model returning text before and after the object.

The issue happens on both the gpt-3.5 and gpt-4 models

alvarolb · November 12, 2023, 3:56pm

Same here, I cannot force the model to return an array as response. Always return an object with at least one key.

valdegg · November 14, 2023, 10:45am

I’m running into a similar problem.

I’ve analysed tens of thousands of items using GPT4, mapping text to json.

I like to analyse multiple items at once, to save on prompt input tokens (analysing a list at a time).

However, I was trying to do the same with the new json_mode but it doesn’t like to return an array of jsons.
It rather returns a million white spaces, or just the first item in the list.

It works for performing a single analysis. I guess it’s constrained to return a dictionary format, not a list.

msp26 · November 14, 2023, 11:05am

Function_calling has never returned arrays on the outer layer so it’s the same with the new json_mode.

You can wrap it like this:
{‘array’:}

But I prefer:
{‘1’: {}, ‘2’: {}, …}

This tends to be more reliable (prompts outputting arrays can randomly miss elements and have index errors) but it’s a bit more trouble to set it up with n_elements instead of a fixed amount.

kjordan · November 14, 2023, 11:15am

From my experience, even if I forced model to return json array with prompts, it’s very unstable and the array could be easily messed up.

Like @msp26 mentioned, object with the string index is way more stable and usable in production.

kane.hooper · November 26, 2023, 4:16am

I would definitely love to see OpenAI resolve this.

I have an app that need to return an array of objects, but as pointed out response_format: {type: “json_object”} only returns the first object of the array. It’s making my app a little tricky to deal with, having to sometimes handle random text prior to the array.

Foxalabs · November 26, 2023, 4:26am

Hi and welcome to the Developer Forum!

You just need top strip the Markdown header and footer from the response:

response.content = response.content.replace(/```json\n?|```/g, '');

kane.hooper · November 27, 2023, 12:01am

I found a solution!

If you want to return a JSON array you have to make sure the top level item of the JSON response is an object.

For example here is my system prompt:

Provide JSON format as follows, along with the definition of each field:

{
    "offers": [
        {
            "description": "...", # This is the product name, e.g. Coronita. 
            "age": "...", # Age of the spirit or wine, e.g. 7
            "edition": "...", 
            "vintage": "...", 
            "release_year": "..."
        }
    ]
}

By putting the array inside the offers object at the top level, my responses are coming back in an array every time!

Here is my chat request with the response_format set:

const completion = await openai.chat.completions.create({messages: message.messages, model: OPENAI_MODEL, response_format: {type: "json_object"}})

AntoineSmet · December 5, 2023, 9:13am

Look, I have something that works

{
  "response_format": {"type": "json_object"},
  "messages": [
    {
      "role": "system",
      "content": "Arrange the rooms from most pleasant to least pleasant to work in. Always provide your result in JSON format."
    },
    {
      "role": "user",
      "content": "I'm looking for the best place to work, I have a choice between 4 rooms:\n - Room Studio: {\"humidity\":38.54,\"temperature\":26.43,\"pressure\":998.67}, Capacity: 6- Room Cockpit: {\"humidity\":40.86,\"temperature\":26.37,\"pressure\":1000.09}, Capacity: 16- Room Loft: {\"humidity\":34.45,\"temperature\":28.23,\"pressure\":1001.61}, Capacity: 8- Room Hall: {\"humidity\":44.3,\"temperature\":24.92,\"pressure\":999.69}, Capacity: 40. Calculate the wellness value and provide a justification."
    }
  ],
  "functions": [
    {
      "name": "calculate_wellness_value",
      "description": "Calculate the wellness value and provide a justification",
      "parameters": {
        "type": "object",
        "properties": {
          "rooms": {
            "type": "array",
            "items": {
              "type": "object",
              "properties": {
                "Name": {
                  "type": "string",
                  "description": "The name of the room"
                },
                "temperature": {
                  "type": "number",
                  "description": "The room's temperature in degrees Celsius"
                },
                "wellnessvalue": {
                  "type": "string",
                  "description": "The well-being score calculated for this room from 0 to 100"
                },
                "justification": {
                  "type": "string",
                  "description": "The justification of the well-being score calculation in one sentence"
                }
              },
              "required": ["name", "temperature", "wellnessvalue", "justification"]
            }
          }
        },
        "required": ["rooms"]
      }
    }
  ],
  "function_call": "auto",
  "temperature": 0.7
}

the.brainiac · April 23, 2024, 6:46am

Amazing
I’m surprised why this hasn’t been accepted.
U genius

elationate · May 5, 2024, 8:16pm

this is the actual solution, makes total sense - fyi gpt4 outputted arrays just fine (fabricating the name of the outer element, but otherwise working just fine) - this made it work for 3.5 and makes total sense why it works.

#geniuuus

Appertivo · May 6, 2024, 12:34am

I also added a json schema to my prompt and have been 100% ever since.

trickman · January 7, 2025, 1:52pm

instead of asking to return:
[
{…
}
//more if found
]

ask to return:
{“data”:
[
{…
}
//more if found
]
}

vb · January 9, 2025, 1:52pm

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Generate JSON array with chat completions API	4	2843	March 20, 2024
Ensure JSON response format API	23	44094	February 19, 2024
How to enable GPT-4-1106 to return a JSON array? API gpt-4 , chatgpt , api	1	1641	February 6, 2024
HTML / JSON / Markdown Output Generation is Very Clunky or out right broken API api , html , json	14	7708	December 1, 2023
How to get 100% valid JSON answers? Prompting gpt-4 , gpt-35-turbo , chatgpt , api	16	7765	June 11, 2024

`json_mode` returns no JSON arrays

Related topics