How can I retrieve data from a PDF that was created from an image captured by a camera?

r.binayram · May 3, 2024, 12:27pm

Is there a way to retrieve text from a PDF created from a camera-captured image using the Assistants API?

nikunj · May 3, 2024, 9:16pm

Unfortunately there’s no way to do this at the moment – we don’t parse images in documents yet.

kennethologist · May 4, 2024, 1:58pm

Would recommend doing some pre-work and use a library to grab each page convert to image and feed the image to GPT Vision to give you the text. However I imagine doing some local OCR process would work as well and be cheaper.

dbafu1 · May 4, 2024, 2:06pm

Before now, I can upload images to poe using the gpt-4 model and get responses, but as at yesterday, this is no longer possible.
Also, same thing while using the api via openweb ui

Topic		Replies	Views
Retriever Assistant can't read scanned pdfs? API gpt-4 , api	7	2996	July 22, 2024
Train assistant to read PDF with images API gpt-4	8	2008	July 22, 2024
Assistants api with Images API	0	52	December 19, 2024
What is the API equivalent of uploading a PDF? API gpt-4o	1	5039	June 20, 2024
Using vision in Assistants and vector databases API assistants-api	3	250	August 25, 2024

How can I retrieve data from a PDF that was created from an image captured by a camera?

Related topics