Image inputs in the GPT-4 API

toni.petrov · September 26, 2023, 11:54am

I saw the announcement here - Image inputs for ChatGPT - FAQ | OpenAI Help Center

Image inputs are being rolled out in ChatGPT (Plus and Enterprise).

Still image inputs are not being rolled out in the API (https://platform.openai.com/), is that correct?

anon34024923 · September 26, 2023, 1:04pm

Ditto to this but for the Voice recognition, really want to use the voice conversation feature in my app

Thiago · September 26, 2023, 1:42pm

“We’re rolling out voice and images in ChatGPT to Plus and Enterprise users over the next two weeks. Voice is coming on iOS and Android (opt-in in your settings) and images will be available on all platforms.”

edit: wrong section was copy and pasted, here is what was ment to be posted: “Plus and Enterprise users will get to experience voice and images in the next two weeks. We’re excited to roll out these capabilities to other groups of users, including developers, soon after.”

anon34024923 · September 26, 2023, 1:52pm

that doesn’t help answer anything on the API release

_j · September 26, 2023, 1:56pm

Voice transcription features can be implemented by using the Whisper API.

You can monitor these official announcements, found on the OpenAI blog, Twitter, and Discord, and update us.

There are no insiders spilling unannounced information here.

Thiago · September 26, 2023, 1:58pm

Ah just re read what was copied and pasted, hang on.

Thiago · September 26, 2023, 2:00pm

“Plus and Enterprise users will get to experience voice and images in the next two weeks. We’re excited to roll out these capabilities to other groups of users, including developers, soon after.”

toni.petrov · September 26, 2023, 2:17pm

Yes, I think that answers the question.

gerasim.sergey · October 11, 2023, 10:03am

hi, when will it be? 15 days have already passed

Foxalabs · October 11, 2023, 11:03am

It will be announced on this forum, social media and the OpenAI Blog when it becomes available, for now, please be patient.

oleksii.romanko · November 1, 2023, 3:29pm

Hi guys! Can you please provide any updates on when approximately developers API will be rolled out?

tanbeige · November 6, 2023, 10:13pm

Just came out with the preview version for API: New models and developer products announced at DevDay

Prod version hopefully soon

Ron0079 · November 8, 2023, 6:06am

The example code for inputting images can be found in the API Reference documentation：
POST https://api.openai.com/v1/chat/completions

from openai import OpenAI

client = OpenAI()

response = client.chat.completions.create(
    model="gpt-4-vision-preview",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "What’s in this image?"},
                {
                    "type": "image_url",
                    "image_url": "https://upload.wikimedia.org/wikipedia/commons/thumb/d/dd/Gfp-wisconsin-madison-the-nature-boardwalk.jpg/2560px-Gfp-wisconsin-madison-the-nature-boardwalk.jpg",
                },
            ],
        }
    ],
    max_tokens=300,
)

print(response.choices[0])

Topic		Replies	Views
GPT4-Vision: Will there be API access? API	5	6010	November 7, 2023
How can we use the images feature added to ChatGPT reently with Open AI APIs? API gpt-4 , api	2	2214	December 19, 2023
ChatGPT can do Q&A on images, but did not find this feature in API API	2	1420	January 31, 2024
When will API support image/audio as input and output? API gpt-4 , chatgpt , api	1	1653	October 9, 2023
Image input for GPT-4 (and related docs) API	1	2277	March 28, 2023

Image inputs in the GPT-4 API

Related topics