Fine tuning models to generate JSON response

kaushalendra · May 11, 2023, 2:06pm

hey everyone! I am trying to fine tune a model where prompt will be string and completion will be a stringified JSON. I tried creating this model but the api reponse is always a string rather than a JSON. I am expecting it to return a strigified JSON with the structure similar to what i provided in completion?
What should I do to get the prompt and completion right and which model will be best for this?
P.S: The JSON i am passing in completion can get complicated with different use case.

jwatte · May 11, 2023, 8:27pm

The model has trouble with JSON.
I’ve found more success using Markdown where possible, or if not, yaml.
Markdown with a template as prompt data, and then asking the model to fill it in, works pretty well!
I can then parse the markdown and turn it back into JSON in my API caller.

kaushalendra · May 13, 2023, 2:36pm

Thanks for the response. But in my case, i have JSON object with multiple levels.

key1 : {
 key2 : {
   key3 : [jajsnasas],
   key4 : ...
}

Something like this. Anyways I can fine tune my model which can return this deeply nested objects as well?

bill.french · May 13, 2023, 7:51pm

In almost every case I see where a fine-tuned model was created, it wasn’t necessary. Can you share the reason you chose this approach instead of the many other pathways to generate JSON?

kaushalendra · May 14, 2023, 5:34am

Sorry I dont know any of the other ways. Can you share some material where I can find it?

bill.french · May 14, 2023, 10:55am

Okay, no. There are many pathways to AI success, but no single pathway should be suggested without at least a general idea of the objective. Share your goal and I’ll be happy to provide some ideas.

seaofarrows · November 9, 2023, 3:24pm

In case it helps,

For several months I’ve been getting solid JSON responses back from the gpt-4 model by providing a format in the prompt and also having the system message say:

“All of your responses should be in UTF8, following the provided JSON format in each request, with no extra commentary.”

Yesterday, I started hitting the gpt-4-1106-preview model and suddenly about half the responses came back with triple backtick quote wrappers. I tried amending the system message:

“All of your responses should be in UTF8, following the provided JSON format in each request, with no extra commentary or quote wrappers.”

But it didn’t fix the problem.

Fortunately, with this model, they have added a parameter for requesting JSON output. You still absolutely need to have a system message like the one above, or it will generate tons of whitespace until context is exhausted. But now you can also specify the response format on the API request like this:

    const request = {
        model: "gpt-4-1106-preview",
        response_format: {"type": "json_object"},
        messages: [
            {"role": "system", "content": systemMessage},
            {"role": "user", "content": prompt}],
    };

Now it’s back to its old self, reliably returning consumable JSON responses.

Topic		Replies	Views
Fine-tuning a Language Model to Generate dinamically specific JSON Structure without Prompting API openapi , fine-tuning , api	13	4253	May 24, 2023
Valid json every time? Prompting	17	12033	January 3, 2024
Best approach for JSON generation API	8	5710	February 11, 2024
Training a model for specific output format rather than content Prompting chatgpt , fine-tuning	5	3284	December 18, 2023
Ensure JSON response format API	23	45392	February 19, 2024

Fine tuning models to generate JSON response

Related topics