CHATGPT API with 200 massive PDF files

s.rodriguez · July 3, 2024, 9:39pm

I have 200 pdf files vectorized and I am using ChatGPT API with Retrieval tool, But my question is what is the best way to reduce the alucinations y get better precision in the answers?

Diet · July 3, 2024, 10:32pm

Welcome to the community!

Your options with retrieval are pretty limited, especially if you don’t want to edit and clean your documents.

In terms of using RAG in general, there’s vigorous debate on the matter. You can join the fray here:

vasyl · July 4, 2024, 1:26am

Hi @s.rodriguez , if those 200 pdf files can have some segmentation based on topic, closely related key-words, or any other characteristic based on which they could be clustered, I would suggest to consider adding a routing / classification layer before RAG. I personally used this approach multiple times for bigger volumes of knowledge base and it worked out quite well. Hope that was of some help.

s.rodriguez · July 4, 2024, 2:37am

Vasyl, Thank you very much for the suggestion.
Our agent, built in Python with the OpenAI API, processes the 200 vectorized PDF files through the “Retrieval” tool offered by the OpenAI API. Currently, we use two internal assistants before generating a response to any chat query about the PDF documents. We could use a third assistant to handle classification and routing to the PDF(s) related to the searched topic. This way, we could direct it to a specific document(s). We will try it. If you have any other ideas on how to perform the routing, they would be welcome. Thank you!

quentinberode · December 14, 2024, 2:58pm

@vasyl and @s.rodriguez

Hi guys,

And thanks you very much for sharing those information.

I am new in using openAi ans Make and i am strugling into get things together. Especially when its about getting Big files together as a usable memory.

Would you mind develloping your answer please ? Even sharing a make scénario picture ans anything that would help me understanding you réflexion !

Please please please.

_j · December 14, 2024, 5:53pm

Here’s the understandable strategy that is optionally described above that one might use to subcategorize domains of expertise, since the agentic use of two assistants wasn’t fully explained.

graph TD
    A[User Query] --> B[Preprocessing, Custom Prompting]
    B --> C[AI 3: Query Classification & Routing]
    C -->|Domain Identified| D1[Assistant 1: General Vector Store]
    C -->|Domain Identified| D2[Assistant 2: Specialized Vector Store]
    D1 --> E[Knowledge Search]
    D2 --> E
    E --> F[Contextual Ranking & Integration]
    F --> G[Response Generation]
    G --> H[Assistant Response Delivery]

Topic		Replies	Views
Problem with doing RAG with 300k pages of PDFs Community gpt-4 , gpt-35-turbo , api	8	6394	March 7, 2024
Retrieval Augmented Generation (RAG) with 100k PDFs?! Too slow! Community pdf , llm , rag , development	13	29268	October 31, 2024
Help with PDF-Based Chatbot and hallucination issues Feedback api , pdf , chatbot , assistants-api	6	1719	August 28, 2024
Using large PDFs to make a ChatBot API chatgpt , api	21	6862	December 15, 2023
How to feed docment to assistant: preprocessing? format? what are the best practices? GPT builders chatgpt , assistants-api	7	486	October 3, 2024

CHATGPT API with 200 massive PDF files

Related topics