Fine Tuning for Vision models

_j · March 2, 2025, 2:18pm

Uploaded image files in storage by ID is only a vision option for the Assistants API.

Fabricating your own type of JSON will not work.

https://platform.openai.com/docs/guides/fine-tuning#vision

Images can be provided either as HTTP URLs or data URLs containing base64 encoded images.

You can prepare your images by downsizing them so the shorter dimension is maximum 768 pixels, or for detail:low, downsize so the largest dimension is 512 pixels or below, along with the optimum compressed file format for the type for the sake of transmission (or you might use double-size jpg if pixel-level color is important). This is the same as is done server-side when sending at inference time, and will make your file upload smaller.

Topic		Replies	Views
Fine-tuning gpt-4o-2024-08-06 with images? API fine-tuning	2	1267	October 3, 2024
Fine-tuning gpt-4o on image data API fine-tuning , fine-tune	9	919	November 29, 2024
GPT4O finetuning with vision capabilities API	2	993	July 24, 2024
Fine-tuning fails due to zero examples Bugs gpt-4	5	96	October 15, 2024
Vision Fine Tuning - Great News but I'm still unable to upload images to the Fine tuned model? API fine-tuning-vision	4	166	October 4, 2024

Fine Tuning for Vision models

Related topics