Using images as context in prompt

MacL · April 29, 2024, 9:14am

Hi all,

I’m working on a classification task. I want the model to identify if the issue mentioned in a table is related to a specific topic. To do so I initially provided text only for context. I would now like to add images (.jpg files) containing all the knowledge base on the topic (images are pdf transformed into jpg that contain: images, tables and text).
Will the model be able to use this additional context ?
Considering it is a pretty large base ~600 pages of pdf, hence ~600 image files what is the optimal way of working?

Merge all the image files into one?
Input all the image individually?
Vectorize the images and follow a RAG architecture?

Thanks for taking the time

Diet · April 29, 2024, 11:11am

Welcome to the community!

You wanna upload images full of text?

Extracting the text first will probably yield better results

MacL · April 29, 2024, 1:18pm

Thanks for the welcome!

Images contain: texts, tables and images. Hence why extracting the text appears limiting as the information from the tables and images would be lost.

Diet · April 29, 2024, 1:37pm

I personally wouldn’t rely on vision to reliably extract table data

You can try, but I wouldn’t expect amazing results, unfortunately. Maybe start with a page or two and see how chatgpt responds.

MacL · April 29, 2024, 2:06pm

How would you do the table extraction without vision?

Diet · April 29, 2024, 2:19pm

Probably ABBYY or some other OCR tool

But even then, I’d probably normalize the table before (into rows) before feeding it to the model

Topic		Replies	Views
Data points in tables and charts in images Prompting gpt-4	7	2194	April 17, 2025
Challenges with GPTs Image Classification: Seeking Solutions Prompting chatgpt , gpts	2	1255	December 16, 2023
Image mapping with prompts API gpt-4 , chatgpt , gpt-4-vision	1	1167	July 19, 2024
Best format to upload a construction plan for extraction of info Prompting gpt-4 , chatgpt , pdf	7	843	April 16, 2025
How to Extract Data from Images Using OpenAI API? API gpt-4	1	3083	October 18, 2024

Using images as context in prompt

Related topics