How to Extract Data from Images Using OpenAI API?

dev002.binarybrix · October 18, 2024, 3:05pm

Hi everyone,

I’m working on a project where I need to extract structured data (like invoice numbers, dates, vendor names, etc.) from images. I initially explored using the OpenAI API, but I encountered some challenges.

I understand that GPT-4 Vision can handle image inputs, but it appears that this functionality isn’t yet available through the OpenAI API. Is there a way to extract data from images using GPT-4 via the API, or should I use an alternative approach?

Here’s what I’m thinking of doing:

Extract text from the image using an OCR tool (e.g., Google Cloud Vision or Tesseract).
Send the extracted text to the GPT-4 API with a prompt to format and extract the relevant data (like invoice numbers, dates, etc.).

I’d love to know if anyone has:

Successfully used GPT-4 for this type of task.
Found workarounds or alternative methods for extracting structured data from images.
Any updates on when GPT-4 Vision might be available via the API.

Looking forward to your suggestions and advice. Thanks in advance!

EricGT · October 18, 2024, 3:35pm

Welcome to the forum.

If I understand you correctly then I did that the other day as an example for another problem, it even used an image of an invoice.

Note: This uses ChatGPT but the same should work for the API if the same model is used. Sorry I can not give you any working API code as I do not use that often but this should show that what you seek is doable.

Also check the OpenAI cookbook: https://cookbook.openai.com/

Topic		Replies	Views
Make OpenAI Vision API Match GPT4 Vision API chatgpt	4	3634	December 6, 2023
How to Programmatically Extract Text from Images Using GPT-4 API gpt-4 , chatgpt , api , assistants-api	9	5021	October 14, 2024
I wanted to extract information from invoice using GPT-4o, which can be image or PDF API gpt4o	4	848	September 18, 2024
OpenAI API to read commercial invoices API api	2	565	September 27, 2024
How to Process PDF Files with OpenAI's Tools and APIs for Invoice Automation? API api , gpt-4-vision , ocr	1	368	January 15, 2025

How to Extract Data from Images Using OpenAI API?

Related topics