Fine-tuning gpt-4o-2024-08-06 with images?

Rednas07 · August 21, 2024, 3:31pm

I am trying to extract complex data from images. Normally, I include examples in my API request, but that takes a lot of tokens.

I am now trying to fine-tune the gpt-4o-2024-08-06, giving examples of how to answer those images:

jsonl file:
{“messages”: [{“role”: “system”, “content”: “My explenation”}, {“role”: “user”, “content”: [{“type”: “image_url”, “image_url”: {“url”: “data:image/jpg;base64, base64_string”}}, {“type”: “text”, “text”: “Do that with this picture”}]}, {“role”: “assistant”, “content”: “Correct answer”}]}
{“messages”: [{“role”: “system”, “content”: “My explenation”}, {“role”: “user”, “content”: [{“type”: “image_url”, “image_url”: {“url”: “data:image/jpg;base64, base64_string”}}, {“type”: “text”, “text”: “Do that with this picture”}]}, {“role”: “assistant”, “content”: “Correct answer”}]}

But I get this error: “The job failed due to an invalid training file. Invalid file format. Please remove all mentions of ‘image_url’ from your file and try again.”

Is it even possible to fine-tune a model with images? What am I doing wrong here?

sps · August 21, 2024, 3:33pm

Hi @Rednas07

As of writing this, fine-tuning gpt-4o-mini is only for text inputs and outputs.

jonathan.roley · October 3, 2024, 1:35pm

@Rednas07 this functionality is now available as of October 1st. Here’s the announcement. https://openai.com/index/introducing-vision-to-the-fine-tuning-api/

Topic		Replies	Views
Fine-tuning gpt-4o on image data API fine-tuning , fine-tune	8	547	October 23, 2024
Multimodal (image) fine tuning with GPT-4 API gpt-4 , fine-tuning	17	5762	October 3, 2024
Can I use images with fine-tuned model API image-reading , gpt-4o-mini	4	56	October 10, 2024
Question on Finetuning: Can you hardcode images or upload image responses via the image_url subkey of content? API	3	116	August 27, 2024
GPT4O finetuning with vision capabilities API	2	916	July 24, 2024

Fine-tuning gpt-4o-2024-08-06 with images?

Related topics