Fine-tuning GPT-4o Mini with image data

ramkumarkoppu · August 9, 2024, 9:26pm

There are no specific details available about fine-tuning GPT-4o Mini with image data in the https://platform.openai.com/docs/guides/fine-tuning, if it is a multimodal model that supports both text and vision tasks

trenton.dambrowitz · August 9, 2024, 9:30pm

Correct, at this time vision fine-tuning isn’t publicly available for any OpenAI models. Hopefully this will change soon!

bofogert · August 12, 2024, 7:48pm

Trying to finetune gpt-4o-mini with images in the training data currently gives the following error

The job failed due to an invalid training file. Invalid file format. Please remove all mentions of ‘image_url’ from your file and try again.

So, it seems using Vision is currently not supported. OpenAI should document this somewhere.

quirk · August 20, 2024, 9:09pm

It seems like they’re working on this in general, but haven’t mentioned images yet:
https://openai.com/index/gpt-4o-fine-tuning/

Topic		Replies	Views
Fine-tuning gpt-4o on image data API fine-tuning , fine-tune	9	1146	November 29, 2024
Fine-tuning gpt-4o-2024-08-06 with images? API fine-tuning	2	1483	October 3, 2024
Fine-tuned model on GPT 4o-mini can't use vision API fine-tuning	1	254	July 26, 2024
Vision on fine-tuned models API fine-tuning	8	328	March 3, 2025
GPT4O finetuning with vision capabilities API	2	1009	July 24, 2024

Fine-tuning GPT-4o Mini with image data

Related topics