Vision Prompting Does not seem to be accurate

promptjudy · February 21, 2025, 11:27pm

Have people tried using vision models to perform PDF rag? What is the type of accuracy you are seeing? Even the latest models arent able to quite read pdf documents without actual text provided (OCR) - or is this a prompting issue?

For some reason it does not allow me to post a link to the run - but below I tried if you want to look at the prompt and tell me if this is a prompting issue

app_promptjudy_com/public-runs?runId=vision-retrieval-augmented-generation-1631582502-gpt-4o%23VMVNNCdEXlmKSWu7uN0ZA

I Send this prompt with 4 images of the links mentioned in the prompt and pretty much all the models do hallucinate on one or more questions. On the other hand, If i send the text of the pages, they all do great… Here is the text only version of the same prompt:

https://app_promptjudy_com/public-runs?runId=retrieval-augmented-generation–1385570120-gpt-4o-mini%23j9LH1lvUmgLQmNM5B22Vo

Below is the performance of vision vs non vision:

promptjudy · February 21, 2025, 11:29pm

For some reason it does not allow me to post a link to the run - but below I tried if you want to look at the prompt and tell me if this is a prompting issue

app_promptjudy_com/public-runs?runId=retrieval-augmented-generation–1385570120-gpt-4o-mini%23j9LH1lvUmgLQmNM5B22Vo

Topic		Replies	Views
How to add correct examples for image-to-text task Prompting gpt-4-vision	5	2496	December 29, 2023
Strange/Bad behavior of Open AI API with vision models API gpt-4 , api	7	986	February 24, 2025
GPT-4o-vision for extraction of complex tables Prompting gpt-4 , gpt-4o	0	561	March 8, 2025
Data points in tables and charts in images Prompting gpt-4	7	2187	April 17, 2025
The performance difference between ChatGPT4o and gpt4o api using the same prompt for image analysis API gpt-4 , chatgpt , gpt-4-vision , gpt4-vision , api-vision	5	1239	July 27, 2024

Vision Prompting Does not seem to be accurate

Related topics