GPT-4 API multimodal access (images)

samgreenberg25 · March 31, 2023, 2:00am

I can’t find much about the multimodal capabilities of GPT-4. I have access to the “gpt-4” model via the API, but I don’t think it can ingest images. Is the multimodal model different, and if so when might it be available? Or is “gpt-4” multimodal and I just can’t find any documentation on that aspect.

sps · March 31, 2023, 3:46am

rick_zhu · September 27, 2023, 6:36am

Considering the recent update: ChatGPT can now see, hear, and speak, where saying: “Voice is coming on iOS and Android (opt-in in your settings) and images will be available on all platforms.”
Have anyone figured out how to use this multi-modal capability from API? I haven’t see any update related to this in the API documentations.

dmalex · September 29, 2023, 4:18pm

Follow this thread as well. Normally API should be released to dev community before official service release.

stepheneliotdewey · November 1, 2023, 7:12pm

I’m also curious if it is possible to use the API to provide an image and get a summary. I want to use this for a project.

tjmcdonough · November 1, 2023, 7:13pm

Same here. It works amazingly through the browser, we need API access

alessandro.moscato · November 8, 2023, 10:16am

Any news on this? It has been announced on the DevDay but I don’t see anything related to it in the API docs. Am I missing something?

Fusseldieb · November 8, 2023, 11:25am

GPT4-Vision is available only through the API, for now. With API I mean code. It’s not in the playground yet.

Also, if you don’t have access to it even through API code, top up your account with at least $0.50 and it should unlock GPT-4 and probably Vision, too.

Topic		Replies	Views
GPT4-Vision: Will there be API access? API	5	6172	November 7, 2023
GPT-4 API for image input API gpt-4 , api	3	2913	November 6, 2023
Image input for GPT-4 (and related docs) API	1	2350	March 28, 2023
Access to GPT4 vision API API api	7	3876	February 28, 2024
How do I use images with the gpt-4 api? API gpt-4	1	1008	August 27, 2023

GPT-4 API multimodal access (images)

Related topics