Efficiently Interacting with super super Long PDFs/documents

alessandroamenta1 · November 13, 2023, 10:05am

Hey everyone, I’m building a simple project for myself, and wanted to know: whats the best way to efficiently interact with super long PDFs (2000+ pages) ? I want to extract information efficiently and have a chatbot interface for easy querying. Any tips or approaches to consider for making this process simple and super effective? Like, should i go the classic approach of langchain + vector db or maybe even try building a custom GPT for this? I care that this is super accurate but I’d love your thoughts on how you’d do this since its super long documents. Thanks for your help!

cypher · June 25, 2024, 8:56pm

Did you end up implementing this? If so what approach did you go with

Munna23 · June 25, 2024, 10:24pm

When it comes to handling PDF interactions, the complexity of your approach and your specific goals will determine the best approach for you. Here are two effective methods to consider:

1. Easy Implementation with File Search Feature

For a straightforward solution, you can utilize the file search feature available in the Assistants API. This approach allows you to quickly implement a functional PDF search capability. It’s ideal if you need a simple and efficient way to search through PDF documents.

2. Custom Tool for Enhanced Functionality

If you’re looking for a more tailored approach, consider creating a custom tool that gets invoked whenever a user asks a question about the PDF. This method give us more control, enabling you to:

Develop a Custom Splitting Strategy: Define how your PDF content is divided for more accurate retrieval.
Implement Custom Retrievers: Use specific retrievers to fetch relevant information based on the user’s query.

EOD you should be passing subset of information which is related to user query back to the LLM. Check out the other posts in the forum where others talked about context based splitting.

Combining your custom tool with the thread management capabilities of the Assistants API, it should do the job. Hope this helps. Cheers

Topic		Replies	Views
Best way to process PDF File that has over 100k lines? API embeddings , gpt-35-turbo , api	6	8558	December 14, 2024
Feedback please: Chatbot to answer questions about long documents API	4	2289	December 17, 2023
Best Way to Process 2500 large PDFs for Specific Data Extraction? API chatgpt , api , langchain , pdf	2	1964	November 3, 2024
Using large PDFs to make a ChatBot API chatgpt , api	21	6582	December 15, 2023
Is there any way by which I can let GPT-4 API summarize large PDF texts? API gpt-4 , api	10	11874	May 6, 2024

Efficiently Interacting with super super Long PDFs/documents

1. Easy Implementation with File Search Feature

2. Custom Tool for Enhanced Functionality

Related topics