GPT-4-VISION forgets image data?

hello44 · November 20, 2023, 2:25am

I’m using the gpt-4-vision-preview API. I’m passing images to the API using data URLs. I’m also passing all the previous messages (message history).

It recognizes the image at first and describes it. If I follow up with another question about the image it will say it cannot read images or to provide it again. But if i ask again, it will recognize the image.

So it seems like the model can look back in the chat and read the image again but it is not always aware of it. Is this normal? Is it just a matter of giving it a system message so it does not to do this?

Spongiform · November 20, 2023, 2:55am

On the web version, it has a tendency to straight up forget that it can analyze images right in the middle of a conversation. At which point you have to remind it gently that it can in fact do it and to “just try it”.

It helps if your image has some sort of unique code to identify it and use that to refer to it. ie: analyze image a5.

_j · November 20, 2023, 3:50am

Vision analysis costs you money every “look”.

If you are sending a URL, you are also relying on someone else to download and insert the image before the AI can answer.

Try putting the base64 encoded image (on your dime of course) into past conversation history turns for the user just as it was originally asked, count the non-stream tokens and see they are being “visioned” again even when not in the most present chat, and then go about asking about that chat history.

hello44 · November 20, 2023, 5:13am

Hi! What do you mean by unique code for identifying? We cannot add any metadata to images. Just the plain image and text along with it.

hello44 · November 20, 2023, 5:14am

Yes this is what I’m doing. Sometimes GPT remembers it can analyze the image, sometimes not.

Topic		Replies	Views
Text generation history forgets analyzed image API gpt-4 , api	0	73	January 9, 2025
GPT-4o forgets image data and sometimes gives answers that have nothing to do with the image API api , assistants-api , gpt4o	10	1295	June 15, 2024
4o & turbo models can't read images anymore API	5	2406	June 4, 2024
"Unfortunately, I am not able to assist with images that contain personal information.." - Vision API response Bugs gpt-4v , gpt-4-vision	1	1144	February 20, 2024
When does Vision process images exactly? API gpt-4-vision	0	167	May 28, 2024

GPT-4-VISION forgets image data?

Related topics