OpenAI API for image text extraction

bonusad · November 13, 2023, 2:17pm

Hi guys,
Can I use current OpenAI API to upload jpeg or PDF file and extract contextual data in JSON format.
In our case we have scanned purchase bills which need to be parsed into our local database.

Foxalabs · November 13, 2023, 2:20pm

Hi and welcome to the Developer Forum!

Sounds like a task for Assistants, you can find more here:

bonusad · November 13, 2023, 2:21pm

Assistants API does not allow to access the processed file:
“Not allowed to download files of purpose: assistants”

Foxalabs · November 13, 2023, 2:23pm

Assistants is an API calling system, you can process data in any form and way you like, what files are you trying to access?

bonusad · November 13, 2023, 2:28pm

This is the path Im following:

STEP 1:
curl …/v1/files
-H “Authorization: Bearer {API_KEY}”
-F purpose=“assistants”
-F file=“@b2.pdf”

STEP 2:
curl …/v1/assistants
-u :{API_KEY}
-H ‘Content-Type: application/json’
-H ‘OpenAI-Beta: assistants=v1’
-d ‘{
“instructions”: “…”,
“tools”: [{“type”: “code_interpreter”}],
“model”: “gpt-4-1106-preview”,
“file_ids”: [“file-ID”]
}’

STEP 3:
curl …/v1/files/FILE-ID/content
-H “Authorization: Bearer {API_KEY}”

FINAL OUTPUT:
{
“error”: {
“message”: “Not allowed to download files of purpose: assistants”,
“type”: “invalid_request_error”,
“param”: null,
“code”: null
}
}

PaulBellow · November 13, 2023, 2:30pm

Note that the Assistants API does not currently support image inputs.

You can find more on the OpenAI GPT-4-Vision docs page…

Hope this helps.

zohaib1 · November 17, 2023, 2:31pm

Hi Guys,
I upload .docx file than i retrieve with file id both endpoints working fine but when i try retrieve file content the response i am getting is:
Note i upload file with purpose assistants

{
    "error": {
        "message": "Not allowed to download files of purpose: assistants",
        "type": "invalid_request_error",
        "param": null,
        "code": null
    }
}

Topic		Replies	Views
Unable to retrieve Assistant file content API assistants , assistants-api	4	1492	March 18, 2025
Assistant API cant read my PDF.. How come? API api	4	2447	July 20, 2024
Uploading a documentation pdf API chatgpt	7	9195	September 12, 2024
Azure OpenAI Assistants API, file upload with "vision" purpose and other issues API assistants , assistants-api , assistants-files , azure-openai	5	1779	September 7, 2024
Assistants API with file search prompt input API assistants-api	2	110	January 29, 2025

OpenAI API for image text extraction

Related topics