I am trying to fine-tune GPT-4o-2024-08-06 for image recognition. I have training jsonl file with 20 examples and validation jsonl file with 10 examples. Each example is in the required compact format according to the vision fine-tuning documentation. I am using image_url instead of uploading the images encoded in base64. Every time I create the fine-tuning job, I get the following error message: “Training file has 0 example(s), but must have at least 10 examples”.
Here is an example:
{“messages”: [{“role”: “system”, “content”: “You are an assistant that identifies objects.”}, {“role”: “user”, “content”: “What is this object?”}, {“role”: “user”, “content”: [{“type”: “image_url”, “image_url”: {“url”: “url/image01.jpg”}}]}, {“role”: “assistant”, “content”: “It’s a book”}]}
No, I manually checked. In fact, I am able to fine-tune with these images in base64, but not when I use the URL. I am using a public GitHub repo to store the dataset.