How to achieve ChatGPT-level PDF parsing with APIs?

sanjeevthakur · August 27, 2024, 5:09am

ChatGPT takes pdf and parses it just fine. How to accomplish at least that level of accuracy using the APIs?

Munna23 · August 27, 2024, 8:43am

@sanjeevthakur - You could use the Assistants API with its file search capabilities, or code a tool that performs Retrieval-Augmented Generation (RAG) each time you need context. The latter approach is often better because it gives you more control over retrieval and chunking strategies. Hope this helps - Cheers!

sps · August 27, 2024, 12:38pm

Welcome @sanjeevthakur

The general idea is to make sure that the model gets all the info that is present on a page in a pdf. Simply extracting text + supplying and image of the page suffices.

Here’s a tutorial from the OpenAI Cookbook that deals exactly with parsing PDFs for RAG.

Topic		Replies	Views
ChatGPT with multiple PDFs giving Gibberish with real Data API	1	1157	January 23, 2024
AI tool to take pdf as input GPT builders chatgpt	3	2170	December 27, 2023
What is the best way to parse a PDF file with ChatGPT? API	9	38511	November 16, 2024
GPT-4 API for Educational Application API gpt-4 , chatgpt	1	1260	December 25, 2023
What is the API equivalent of uploading a PDF? API gpt-4o	1	2763	June 20, 2024

How to achieve ChatGPT-level PDF parsing with APIs?

Related topics