GPT4-V: the order of multiple image inputs

rsomani95 · January 17, 2024, 3:05pm

I’ve been facing this same problem. I thought maybe we could interleave image inputs with text but the API doesn’t seem to like that.

My content was setup as follows:

PROMPT_MESSAGES = [
    {
        "role": "user",
        "content": [
            {
                "type": "text",
                "text": "Here are a few images I have on hand. I'd like you to pick the most appropriate one for a Christmas greeting card I'm sending out on behalf of my family."
            },

            {
                "type": "text",
                "text": "This is image #1"
            },
            {
                "type": "image_url",
                "image_url": image_to_base64(img1)
            },

            {
                "type": "text",
                "text": "This is image #2"
            },
            {
                "type": "image_url",
                "image_url": image_to_base64(img2)
            },
        ],
    },
]

To which I received the “I’m sorry, I cannot assist with these requests.” response that others in the forum have gotten for different reasons

Topic		Replies	Views
Api image/text order with gpt-4v API gpt-4 , gpt-4-vision	2	1552	March 22, 2024
Referring to multiple images in vision API API gpt-4	7	4943	October 26, 2024
Images input order with gpt-4 vision/omni API gpt-4-vision , gpt-4o	0	1132	May 20, 2024
Does the order of items in content array affect the response with gpt4-vision API gpt-4 , gpt-4-vision	2	773	January 15, 2024
How to identify photos when batching for gpt 4 vision API	3	1769	March 18, 2024

GPT4-V: the order of multiple image inputs

Related topics