Cannot send image_url to gpt-4o

kevrab · May 19, 2024, 1:09am

I’m trying to send image_url under ‘user’ role to gpt-4o. However every time I send it, it complains with that the model does not support image_url:

Invalid content type. image_url is only supported by certain models.

I’ve tried other models like gpt-4-turbo, but every time it gets rejected. I’m on Tier 1 usage currently which appears to allow me to use these models, but is there something else I’m missing? Or is this a bug?

nick.youngblut · May 27, 2024, 7:33pm

I’m getting the same (openai 1.30.3).

My message content:

{'type': 'image_url', f'data:image/png;base64,{base64_image}'}

I’m using base64 png images pulled from a Jupyter notebook.

My chat completions code:

stream = client.chat.completions.create(
    model="gpt-4o",
    messages=my_messages,
    stream=True
)

for chunk in stream:
    if chunk.choices[0].delta.content is not None:
        print(chunk.choices[0].delta.content, end="")

The response:

BadRequestError: Error code: 400 - {'error': {'message': 'Invalid content type. image_url is only supported by certain models.', 'type': 'invalid_request_error', 'param': 'messages.[1].content.[1].type', 'code': None}}

iiitmahesh · May 27, 2024, 8:01pm

is there way to upload files like gpt-4o ui through API?

sashirestela · May 27, 2024, 8:18pm

That functionality is working by my side.

First I’m creating both an Assistant (with gpt-4o) and a Thread (empty).

Next, I’m hitting this endpoint:

POST https://api.openai.com/v1/threads/{threadId}/messages
Body:

{
  "role": "user",
  "content": [
    {
      "type": "text",
      "text": "Do you see any similarity or difference between the attached images?"
    },
    {
      "type": "image_file",
      "image_file": {
        "file_id": "file-Vl3rOrhupx0MFHwsVjvjva4S",
        "detail": "low"
      }
    },
    {
      "type": "image_url",
      "image_url": {
        "url": "https://upload.wikimedia.org/wikipedia/commons/e/eb/Machu_Picchu%2C_Peru.jpg",
        "detail": "low"
      }
    }
  ]
}

Then, I’m hitting this other endpoint:

POST: https://api.openai.com/v1/threads/{threadId}/runs
Body:

{
  "assistant_id": "asst_kfoHDe4qk8JwqOwtM9PdROJN",
  "stream": true
}

And I’m receiving this response in chunks of text:

Both images appear to be identical, depicting the iconic site of Machu Picchu in Peru. Machu Picchu is a 15th-century Inca citadel located in the Eastern Cordillera of southern Peru. It is set on a mountain ridge and is situated approximately 2,430 meters (7,970 feet) above sea level.

Similarities:
1. Both images show the same panoramic view of the Machu Picchu archaeological site.
2. The notable features such as the terraces, stone structures, and the prominent Huayna Picchu mountain in the background are visible in both images.

Since the images are the same, there are no differences to point out. This preserved Incan site demonstrates remarkable engineering and architectural skills, and it's a significant cultural heritage of the Inca civilization.

Just in case, I’m using the simple-openai library.

Topic		Replies	Views
Gpt-4-turbo-2024-04-09 not accepting images? Bugs api	8	3220	April 10, 2024
Unable to send images to gpt-4-turbo or gpt-4o API	2	1333	May 17, 2024
GPT-4o API bug: Can't take in image_url from assistant in messages, only user Bugs gpt-4o	3	950	May 29, 2024
Gpt-4-turbo-2024-04-09 not accepting images in thread history Bugs gpt-4 , assistants-api	17	1744	May 13, 2024
GPT-4o Error: Image URLs in System Messages API gpt-4o	4	1957	June 14, 2024

Cannot send image_url to gpt-4o

Related Topics