How to process image in ChatGPT4

kazazic · March 16, 2023, 2:04pm

Chatgpt 4.0 should be able to process images.
I have ChatGPT-plus Subscription and I can use gpt4.0 but how can I upload an image or pdf?
thanks

wfhbrian · March 16, 2023, 3:41pm

As far as I can tell, image support is currently unavailable in ChatGPT Plus.

kazazic · March 16, 2023, 11:08pm

but on the anouncment page its stated that it should GPT-4
or am I missing something ?

Bardolas · March 17, 2023, 4:24pm

I’m on the same situation. They say that here https://openai.com/product/gpt-4 but maybe they run into something. Rookie mistake to buy something without checking it first, not talking about GPT4 but in general. I’ve been testing GPT4 vs GPT3.5 and the way it delivers the answer is vastly superior and I’m glad I bought it, but the image processing was what I was the most hyped for. Guess we’ll have to wait. Thank you for bringing this up and that I’m not the only one not finding it

shailesh1 · March 18, 2023, 5:08am

Bought chatgpt plus for image input function and guess what, there is no way to give image to ChatGPT. Openai should refund my money

shubh220922 · March 20, 2023, 6:43am

I think we should wait for some time until image feature comes.

teafish · March 21, 2023, 2:19am

I asked gpt4 how to upload an image, and it told me, including steps. You have to use a third party AI API to convert the image into text, then you can paste that in.

My prompt was:

How do I upload an image into gpt 4?

juanluisgarrido · March 21, 2023, 2:25am

yesterday I worked on a code where I gave a url with a control panel of a c# code and I asked it to make the code we were working on strictly follow that user interface and he did it but today it has been blocked and does not allow that.

squadratlantica · March 21, 2023, 4:58pm

Same for me, the image processing was my goal

phild · March 31, 2023, 5:27pm

Image ingesting seems to be temporarily removed from the API docs. Watching the GPT-4 livestream at 7:47 you can see the documentation on his screen. But all that seems removed right now.

I have gpt-4 access, and I just tried to ingest an image using that format using the API. It returned an errored. I think they decided to hold off on that feature for now.

mikeadoptsearly · March 31, 2023, 6:26pm

Flow Charts: Pseudo Code

Today I just found a way to let GPT 4 “see” some flow charts. I had a flow chart about risk assessment in organizations (see image) as part of a longer text, which ChatGPT/GPT 4 had to process somehow. So I decided to describe it in pseudo code and we were able to easily speak about that process. (I’m from Germany so the words are in German, but the idea should get clear.) I’ve done this with two other business processes and observed that ChatGPT/GPT 4 also understands less strict pseudo code very well (see second example). ChatGPT also gave me the hint to use python-pptx to extract and describe PowerPoint shapes to it.

Pseudo code:

let Evaluation = undefined;
let Document = undefined;
let Betrieb = new Betrieb("Beliebiger Betrieb");
function Gefaehrdungsbeurteilung(): void {
    let hasGefaehrdung;
    do {
        let potenzielleGefaerdungen: Array<any> = Bestandsaufnahme();
        Document.write("Potenzielle Gefaehrdungen: " + potenzielleGefaehrdungen);
        hasGefaehrdung = false;
        for(g of potenzielleGefaehrdungen) {
            if(g is Gefaehrdung) {
                hasGefaehrdung = true;
                let Masznahmen = AbleitungMasznahmen(g);
                Document.write("Masznahmen: " + Masznahmen);
                Umsetzung(Masznahmen);
                Evaluation = Bewertung(Masznahmen);
                Document.write("Evaluation: " + Evaluation);
            }
        }
    } while(hasGefaehrdung);
}

Betrieb.addEventListener("Anlass nach § 5 ArbSchG", Gefaehrdungsbeurteilung);

Second Example:

// translated from German to English
if(absence days last 12 months > 30 days)
     First contact between the management and those affected
     if (person concerned wishes BEM)
         Composition of the BEM team
         Initial discussion between the person concerned and the BEM team
         if (person concerned wishes BEM)
             do
                 Case discussion between the person concerned and the BEM team
                 if (person concerned wishes BEM)
                     Measures are carried out together
                     The effect of the measures is evaluated together
                 else
                     break
             while (affected person wants BEM && measures do not work)
             Final discussion between the person concerned and the BEM team
Documentation of the BEM

SomeUser2022 · March 31, 2023, 7:14pm

I’d say thats a super cheeky way to get access, but he even highlighted it in the video, so maybe it was supposed to be a backdoor beta with plausable deniability

phild · March 31, 2023, 7:56pm

Not quite. He mentioned copying the image ingest documentation from an OpenAI blog post from March 1, 2023.

I was able to find that blog post, but there is no mention of the image ingest API on it.

Edit: Upon one further re-watch, he says he simply typed the image documentation in there.

In any case, I wasn’t able to get it working. But also, you’re really gonna need the gpt-4-32k model for images, because you’re passing in the raw byte code of the image. And 32k is not yet released.

I’d love to be a tester on that, if anyone at OpenAI is reading this!

claudiodipolitto · April 3, 2023, 3:26pm

This is a “very strange” situation, as the ChatGPT-4 Plus version was supposed to understand images. Is there a way to check this with Open AI support?

serggladskikh · April 18, 2023, 2:51pm

I have just cancelled my Plus subscription due to the lack of announced image processing.

rjsmith2007 · June 15, 2023, 3:46am

I agree. I am cancelling as well. I don’t think I will be subscribing again. Any other company that pulled these bate and switch tactics would be sued.

aya.doha1111 · June 17, 2023, 5:27am

nirmal7803 · July 9, 2023, 9:00am

give me its swimlane diagram

danmckinney · August 7, 2023, 4:23pm

When I first started using ChatGPT 4, I directed it to an uploaded PDF containing an image. I asked if it could “see” it, and it said it could. It even gave me a decent description of what it saw. Really impressive!

But then after a bit it told me something completely wrong about the uploaded image, and I asked it, “Are you REALLY looking at the image?” It said it wasn’t, and apologized for the “confusion”, and admitted it had no image processing capabilities.

It was able to describe the image fairly accurately because it already had information about it I had uploaded in text form. It relied on THAT data, but pretended it was getting it from the image file. WTH?

This happens A LOT with ChatGPT. It says it’s doing something, reading a web page, looking at an image, reading a PDF, whatever, and it just isn’t. You need to always be on your toes with this thing to make sure it’s not hallucinating/confabulating/lying.

creativenorthmedia · September 10, 2023, 11:44am

Agreed. This needs to be addressed, completely unethical. I get that this technology is complex, so just say you haven’t figured it out yet, don’t have me buying something that doesn’t work as advertised.

Topic		Replies	Views
ChatGPT goes Multimodal! Sound and vision is rolling out on ChatGPT Community chatgpt , multimodal	34	13157	December 10, 2023
GPT-4: 32k and Image recognition API gpt-4	16	8838	July 20, 2023
GPT-4o model saying it can't read my images API gpt-4 , chatgpt , api	15	15238	November 27, 2024
False information about visual input Community gpt-4	3	548	September 17, 2023
GPT-4 is here! OpenAI's newest language model Announcements	71	26538	December 13, 2023

How to process image in ChatGPT4

Related topics