Few shots with multiple images

abdulrahman-alghofai · January 28, 2025, 10:03am

Hi there. I’m trying to fine-tune an AI model’s behavior using few-shot examples in OpenAI’s chat.completions.create endpoint, but I’m struggling to structure it properly.

The goal is to have the model process an input image (that a user would provide) and respond with a specific textual output based on the image. I want to include a few examples to guide the AI’s reasoning and response style.

Does anyone have experience structuring this type of image-to-text few-shot setup? Specifically:

Where should I place the examples for the most effective guidance?
How do I reference image inputs effectively in prompts (e.g., placeholders or descriptions)?

_j · January 28, 2025, 11:00am

We’ll call it “optimize” instead of fine-tune, as that is a term reserved for further model training.

The current models supporting vision work better when instructed on a task, with examples as part of that instruction-following. However you are prohibited from placing images into a system message.

Therefore, the only thing you can indeed do is to provide user input and assistant responses of the quality and perception you desire. The AI will treat this like chat that actually happened though, and chat (over) trained models will somewhat disregard this, having less propensity to pick up a skill, but rather to answer about them.

You can make one modification, though. That is to use the name field for both roles. Give a name like “exampleIn”, “exampleOut”, and they will be set apart from normal user input and responses. Then you can give system messages that are still instructional instead of relying on pattern training in-context, such as “always follow the style of response demonstrated by assistant:exampleOut messages, placed by the developer”.

Topic		Replies	Views
Image labelling with image-based instructions API assistants-api	5	429	September 3, 2024
Few shot learning with gpt-4 - is it needed and what is best practice? API gpt-4 , api	3	7862	October 27, 2023
Gpt-4 vision few shot prompting with images API	3	3288	May 29, 2024
Few-Shot Prompting with Structured Outputs Prompting gpt-4 , chatgpt , api	1	1008	December 7, 2024
I want to create few shot example with Assistants API API api	7	1677	March 3, 2024

Few shots with multiple images

Related topics