I wanted to extract information from invoice using GPT-4o, which can be image or PDF

bisratx · September 6, 2024, 1:38pm

I have an API that accepts image or PDF invoice and extract the information from it and respond in json format.

In my previous implementation i’ve used Azure document processors (invoice) for extraction and Open AI API for customizing the response.

but now i wanted to switch fully to openAI api.

is there an API that can support this?

bisratx · September 6, 2024, 2:10pm

I’ve found this File uploads FAQ | OpenAI Help Center article, saying the API version for file upload will be available soon.

the article is posted a week ago, but please feel free to share if there is any latest news about it.

sps · September 7, 2024, 8:24am

If the goal is to simply extract info from a pdf invoice, you can do it with chat completions API using vision capability.

Just convert the uploaded PDF doc’s pages into image files with supported format and consume them over the vision modality to extract info you want.

jochenschultz · September 8, 2024, 7:34am

I am sorry, but is that an assumption or did you run a 600 page AWS invoice through it and got all the right values?

bisratx · September 18, 2024, 7:55am

converting the pdf to an image may add load time to the API which will make the time the same as the first approach

Topic		Replies	Views
How to Extract Data from Images Using OpenAI API? API gpt-4	1	489	October 18, 2024
Document Details Extraction API gpt-4 , assistants-api	1	626	October 13, 2024
OpenAI API to read commercial invoices API api	2	269	September 27, 2024
Make OpenAI Vision API Match GPT4 Vision API chatgpt	4	3391	December 6, 2023
Scanned pdf with API and ask questions API chatgpt , api	3	326	October 15, 2024