Finetuning structured outputs

asdasd2 · August 13, 2024, 8:02pm

Hi there, I have been using the new structured outputs feature and it has been working great for a few of my use cases. I am now hoping to use gpt4o-08-06 responses to finetune gpt-4o-mini-07-18 to my use case.

I am planning to do this in a maintainable way where I have a script that I can basically input a .txt of prompts into along with a pydantic response model, then automatically get the gpt-4o-08-06 responses via batch api which will automatically be saved to finetune a gpt-4o-mini model. I feel like this will be really useful for any case where gpt-4o-mini can’t quite cut it on its own without finetuning, but gpt-4o-08-06 is overkill. Basically it allows finetuning without having to take the time to manually assemble prompt response pairs. I’m planning to share the repo for this once I have it in a good place as well.

Has anyone tried finetuning with the new (aug 6th) structured outputs yet? If so, maybe you can answer my question:

Do you need to include response_format schema in user messages in your finetuning jsonl? Or should it just be the system/user/assistant unstructured messages even if response_format schema is being used in the background? I think the key issue here is that the gpt-4o output is affected by the field descriptions in the response_format definition, so how can the finetune capture that aspect of the prompting unless you include it as part of the user message in the finetune file?

aukinfo · September 9, 2024, 10:04am

Hi, how did you get on with this? I am going to try fine tuning with structured outputs. What did you do with the response_format schema? Did you find any useful examples online - I couldnt find any fine-tune examples for structured output. Thank you.

asdasd2 · September 9, 2024, 9:33pm

{"messages": [{"role": "system", "content": "You are a sentiment analysis model. You will be given a text and asked to analyze the sentiment of the text. You will return a sentiment, intensity, and label."}, {"role": "user", "content": "The airline's handling of my special meal request was flawless."}, {"role": "assistant", "content": "{\"sentiment\":\"positive\",\"intensity\":0.9,\"label\":\"service\"}"}]}

I am doing it like the above, where i dont include response_format but i structure the assistant output in the same way it comes out when using response_format. Attempting to include response_format was causing errors when I last tried it, although it has been a little while since I worked on the finetuning stuff since I’ve had other projects pop up since then.

aukinfo · September 10, 2024, 6:02am

Thanks @asdasd2 , this is very useful

Topic		Replies	Views
Fine-tuning GPT-4o: Structured Output Support and Data Preparation API fine-tuning , gpt-4o , structured-output	2	760	September 12, 2024
Can I use `response_format: {type: "json_schema", ...}` in fine-tuning? API fine-tuning	6	1222	September 24, 2024
Is Fine-tuning gpt-4o-mini compatible with Structure-outputs? Community fine-tuning , structured-output	2	605	January 4, 2025
Does fine-tuning teach the system role? API fine-tuning-problems	6	756	August 2, 2024
Finetuning GPT 3.5 for Assistants API and Code Interpreter API fine-tuning , assistants-api	7	856	May 7, 2024

Finetuning structured outputs

Related topics