Technically gpt-4o is now available for fine-tuning. However, just like in the past with gpt-4, you must request access and describe you intended use case via a dedicated form to then maybe get access to it.
Only users with a decent track record of fine-tuning are given the option to request access though. You can check in the fine-tuning UI whether if you are eligible.
In case anyone else sees this before trying it, I uploaded a training dataset with image input and it failed with the message The job failed due to an invalid training file. Invalid file format. Please remove all images from your examples and try again.
Below is an example of the data format I am working with:
{
“messages”: [
{“role”: “user”, “content”: “What does this image represent?”, “image”: “data:image/jpeg;base64,<encoded_image_data>”},
{“role”: “assistant”, “content”: “This is the logout button, which means ‘sign out’.”}
]
}
In this example:
<encoded_image_data> represents the Base64 encoded string of the image.
The intent is to allow the model to process the image alongside the text input, then generate a relevant text-based response.
However, as indicated, it is currently not possible to fine-tune with images .