Which is the best approach to do chat with PDF application (RAG, Fine Tuning, Open AI Assistant)?

ani_Ashok · September 17, 2024, 12:10pm

Hi All,

I am working on creating an application like chat with PDF (After uploading a question, when user asks question it should return the answer)

I initially started with simple RAG Approach (steps given below)

Extracting PDF text content and converting it into embedding (using openAI embedding model) then store it in the chromaDB vector database
When user asks question that will be converted into embeddings
With the user question I will retrieve similar text chunks from the chromaDB
Send context with user question and similar text chunk to the open AI model
OpenAI model will generate the response

This is working good at initial level. But it is confusing when it comes situation like I want to retrieve image along with the text when user asks question

When I research I found a Multi Modal RAG techniques

My actual questions are

What about openAI Assistant. It’s still in Beta Can I use that for this will it work on images
can I continue with the RAG technique or is there any feasible method when compared with RAG.
If I continue with RAG can I implement this without Framework like langchain or LlamaIndex. Is this possible as a beginner level?

I would appreciate any help on this

ani_Ashok · September 17, 2024, 12:49pm

Thank you @marcolivierbouch
I am curious about What about Images? will openAI assistant return images along with the text when user asks question.

ani_Ashok · September 17, 2024, 1:04pm

It seems like complex. Can you give me an advice As a beginner whether I can Proceed with the RAG multimodal for image retrieval. or analyzing openAI Assistant will be worth?

PaulBellow · September 19, 2024, 3:48pm

A post was merged into an existing topic: Best tool to build chatbot using Assistant API - OpenAssistantGPT

Topic		Replies	Views
Problem with doing RAG with 300k pages of PDFs Community gpt-4 , gpt-35-turbo , api	8	5614	March 7, 2024
Retrieval Augmented Generation (RAG) with 100k PDFs?! Too slow! Community pdf , llm , rag , development	13	26079	October 31, 2024
Using vision in Assistants and vector databases API assistants-api	3	250	August 25, 2024
Integrating OpenAI API for Comprehensive Knowledge of My Web App API	2	241	January 23, 2025
Did assistant api kill manual RAG with vector databases? API	8	6812	December 18, 2023

Which is the best approach to do chat with PDF application (RAG, Fine Tuning, Open AI Assistant)?

Related topics