GPT-4 API and image input

solanpaa · March 16, 2023, 6:38pm

Hi there,

Is there a documented way to supply GPT-4 API with images?

I couldn’t find anything in OpenAI’s website.

JakeFromStareFarm · March 16, 2023, 6:51pm

Looks like receiving image inputs will come out at a later time. This is what it said on OpenAI’s document page:
" GPT-4 is a large multimodal model (accepting text inputs and emitting text outputs today, with image inputs coming in the future) that can solve difficult problems with greater accuracy than any of our previous models, thanks to its broader general knowledge and advanced reasoning capabilities. Like gpt-3.5-turbo , GPT-4 is optimized for chat but works well for traditional completions tasks."

renandiiias · March 16, 2023, 6:55pm

The GPT-4 is on the GPT 3.5 platform, where users who use the Plus version got access to the GPT-4 almost immediately after the launch. But what about those who have API access? I just got mine, and I’m thinking of creating an API for Discord to try the “same” test that was done in the presence of GPT-4 by OpenAI. Does anyone know anything about it?

hello10 · March 16, 2023, 8:54pm

I swear that I used “Describe this image to me” and then pasted the URL of an image, and GPT-4 described the image perfectly to me earlier this morning. Way better than I thought it would have. Then I tried again for a long time and couldn’t get it to work again. Describing images back to me is the main thing I want to do.

alexzam · March 18, 2023, 12:36am

It was inferring the image contents from the URL.

davewave · March 18, 2023, 2:21am

For now you can look at Visual GPT:

It hooks into a 3rd party image interpreter. It can work for some things, but I am assuming GPT-4’s image recognition will be far more in depth.

hello10 · March 19, 2023, 4:46pm

That is partly true. On that initial test, I uploaded a random image that I found on Google, and it happened to be a tree in front of a sunset. When I asked GPT to describe the image, it described a sunset and a tree, and I assumed it actually worked.

Then when I tried again later, I got a mixture of three responses. One - It would tell me that it can’t look at images. Two - It would guess what the image was based on the URL. Three - it described a sunset and a tree.

For whatever reason, no matter what image I linked to, it would describe it as a sunset and a tree. It was a random coincidence that I uploaded a picture of a sunset and a tree, which appeared to work.

el-guapo · April 14, 2023, 6:45pm

You should consider the possibility that this is such a great desire for you that you daydreamed it

msimon · April 24, 2023, 2:26pm

have you seen the movie Contact? a beach with a tree… maybe AI only sees a sunset and a tree in any dataset.

bakaburg1 · July 15, 2023, 6:20pm

I don’t understand. Where did you upload it? is there any online platform?

_j · July 15, 2023, 6:38pm

This person from four months ago likely just provided the AI a web link, and then as he describes, either the AI was able to recognize a well-known link from its knowledge base, or it was able to hallucinate on the contents of /cute/kitten_playing.gif from the title.

In fact, one can discover this by reading:

You could give the AI a fake URL from a news site, but replace the URL’s description with your own words, and it would “summarize” a whole story just based on the contents of the link.

Computer vision still is not released.

If you have access to ChatGPT code interpreter, you can now upload files there to the python environment, to be manipulated by python commands.

TravisL12 · July 23, 2023, 2:50pm

I found AIHub by Instabase which takes documents and allows ChatGPT to read them.

qizho271 · August 22, 2023, 6:11am

https://help.openai.com/en/articles/7127995-how-can-i-use-gpt-4-with-images

james8 · September 27, 2023, 6:40am

It’s just made available yesterday. Yipee!

tkataja · September 27, 2023, 7:25am

It is not generally available for API users or at least not documented?

lindseypeng873 · September 27, 2023, 5:57pm

any one have figured out how to supply an image to API call and ask question from it? the chat gpt is able to do it now

gokulsrin · September 28, 2023, 8:18am

I’ve been trying to figure out how to supply an image to the API but I haven’t got it working yet. Even the following does not work:

response = openai.ChatCompletion.create(
model=“gpt-4”,
messages=[
{“role”: “system”, “content”: “You are a helpful assistant.”},
{“role”: “user”, “content”: f"what is this image {img_url}"}
]
)

output: “I’m sorry for the inconvenience. As a text-based AI, I don’t have the ability to view or interpret images. You may want to use an image search engine or AI that specializes in image recognition for assistance.”

Hopefully someone can figure it out.

_j · September 28, 2023, 11:08am

You can stop wasting your time asking and looking. There is no date of API availability

Plus and Enterprise users will get to experience voice and images in the next two weeks. We’re excited to roll out these capabilities to other groups of users, including developers, soon after.

When the API starts taking lists instead of strings as “content”, then one might conclude something is going on. When you get the response “Sorry, I can’t help with that”, then you know it’s working as designed.

blakeyoung81 · September 30, 2023, 4:49am

Amazing! But I was wondering what the syntax is for uploading png files as context. Is there a place where they outline the documentation?

anon22939549 · September 30, 2023, 5:15am

This is not possible right now. You have to wait, then the documentation will reflect how to do it.

Topic		Replies	Views
Image inputs in the GPT-4 API API gpt-4	13	25491	February 6, 2024
ChatGPT goes Multimodal! Sound and vision is rolling out on ChatGPT Community chatgpt , multimodal	34	13719	December 10, 2023
How to load a local image to gpt4 -vision using API API gpt-4-vision	4	46678	February 27, 2024
"I'm sorry, I can't assist with these requests." with Vision API API api , gpt-4-vision	6	14054	December 18, 2023
Make OpenAI Vision API Match GPT4 Vision API chatgpt	4	3914	December 6, 2023

GPT-4 API and image input

Related topics