Can ChatGPT analyze images using the API?

SB_Access · July 26, 2024, 9:34pm

I’ve seen conflicting info about this. I know that ChatGPT can analyze images that are manually uploaded into the chatgpt web interface, but is there a way to analyze images with API prompts?

I am using a Javascript node project and I have been able to successfully upload image files to ChatGPT and retrieve their IDs using the API, but when I ask ChatGPT to analyze the images I get responses like “I’m sorry, but I don’t have the capability to analyze images directly. However, I can help you with a description or analysis of the content if you provide a detailed description of the image.”.

I’ve tried using different models and I get the same response with all of them. Is this feature unavailable, or am I approaching it wrong? Here is a snippet of the code that I’m using with the latest openai node package:

const messages = [
  { role: 'system', content: 'Analyze the following images.' },
  { role: 'user', content: 'Image ID: file-12345' } // Placeholder file ID
];

const completion = await openai.chat.completions.create({
  messages: messages,
  model: "gpt-4",
});

Thanks!

supershaneski · July 26, 2024, 11:27pm

You need to use the vision format for the messages and use gpt-4-turbo, gpt-4o or gpt-4o-mini models.

const messages = [
  { role: 'system', content: 'Analyze the following images.' },
  {
        role: "user",
        content: [
          { type: "text", text: "What’s in this image?" },
          {
            type: "image_url",
            image_url: {
              "url":  your_file, // either url (not local) or base64. file id is used only in assistants api.
            },
          },
        ],
      },
];

const completion = await openai.chat.completions.create({
  messages: messages,
  model: "gpt-4o-mini", // use model that can do vision
});

Topic		Replies	Views
Do ChatGPT API supports image describe functionalities via API? API gpt-4 , gpt-35-turbo , chatgpt , api	1	722	July 16, 2024
What are the APIs for image analysis? API gpt-4 , api	2	10237	May 17, 2024
Title: How to Enable Image and PDF Analysis with ChatGPT API Version 1.33.0 API gpt-4o	1	599	July 6, 2024
Unable to directly analyze or view the content of files like (local) images API chat-completion , gpt-4-vision	3	859	November 7, 2024
Which is correct model for image analysis? API	5	278	December 9, 2024

Can ChatGPT analyze images using the API?

Related topics