RL Finetuning with Text + Image Data - Image Data not Supported

VLMFinetuner · September 11, 2025, 1:31am

Reinforcement Fine-Tuning with Images - Getting “method does not support it” Error

Problem Description

I’m trying to use OpenAI’s reinforcement fine-tuning API with multimodal data (text + images). However, I’m encountering an error that seems contradictory to the API documentation.

Error Message

The job failed due to a file format error in the training file. 
Invalid file format. Input file <file id> contains images, 
but the method `reinforement` does not support it.

Documentation Confusion

The OpenAI reinforcement fine-tuning documentation states:

“Input messages may contain text or image content only. Audio and file input messages are not currently supported for fine-tuning.”

This suggests that images should be supported, but the error message indicates otherwise.

My Data Format

I’m formatting my training data in JSONL with the following structure:

{
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "prompt..."
        },
        {
          "type": "image_url",
          "image_url": {
            "url": "https://..."
          }
        },
        ...
      ]
    }
  ],
}

Questions

Does reinforcement fine-tuning actually support images? The documentation suggests yes, but the error suggests no.
Is the image format different for reinforcement fine-tuning vs. simple API calls? I’ve structured my messages to match regular API calls, but not sure if for finetuning the format needs to be handled differently.

Has anyone successfully used images with reinforcement fine-tuning? Or is this a known limitation that’s not clearly documented?

Any guidance or workarounds would be greatly appreciated. If images aren’t supported, it would be helpful if the documentation could be updated to clarify this limitation.

Thank you!

Topic		Replies	Views
Fine-tuning gpt-4o on image data API fine-tuning , fine-tune	9	1433	November 29, 2024
Fine-tuning gpt-4o-2024-08-06 with images? API fine-tuning	2	1722	October 3, 2024
Question on Finetuning: Can you hardcode images or upload image responses via the image_url subkey of content? API	3	284	August 27, 2024
Fine-tuning GPT-4o Mini with image data API gpt-4o-mini	3	1769	August 20, 2024
Issue: gpt-4.1-nano fine-tuned model cannot analyze images - blocked by endpoint validation Bugs gpt-4 , gpt-41	17	1086	August 11, 2025