Is it possible to analyze images contained in pdf files?

Hello,

I have pdf files that contain images as well as text. I would like to ask ChatGPT / my custom GPT / gpt4 via the assistants API questions about these documents, not only about the text but also about the images.

I suspect this is currently not possible, as GPT is saying it can analyze the image content in the uploaded pdf, but the answers (e.g. when asking what is shown on a particular image ) seem like it guessed what is in it from the surrounding text.

So I would like to confirm, can GPT “see” / have access to images in pdf files or is only an OCR performed on the files?

Thanks!

3 Likes

Would like to know the same.
Any update on this from the team?

It’s not currently possible, I did this, and it said it’s unable to see any images in a PDF file.