RAG document training via Open AI

razvan.i.savin · September 30, 2024, 3:00pm

From your description, here’s what I gather:

Current RAG Integration:
- Training Documents: You have a set of documents that you’ve trained your RAG (Retrieval-Augmented Generation) system on, likely by creating embeddings.
- Vector Store: These documents are stored in a vector store, which the RAG system queries to generate responses.
- Shared Responses: All users interact with the chatbot, and responses are fetched from this shared set of trained documents.
New Use Case Requirements:
- User-Specific Documents: Allow individual users to upload their own documents.
- Temporary Validity: The uploaded documents should only be accessible to the uploading user and only for the duration of their current session.
- Session Isolation: Once the session ends, the uploaded document should no longer influence the chatbot’s responses for that user.

Based on your draft, here are a few points to clarify:

Vector Store with Files and RAG:
- Are you using a specific vector store library (e.g., FAISS, Pinecone) to manage your embeddings?
- How are you currently organizing and querying these embeddings within your RAG setup?
Training Documents:
- When you mention “training different documents,” are you referring to creating embeddings for each document?
- Do you preprocess these documents (e.g., tokenization, cleaning) before embedding them?
Embeddings vs. Plain Text:
- Do you create embeddings each time a document is uploaded during a session?
- Are the costs associated with creating embeddings justified, especially if the file is updated and you need to recreate embeddings?
- Do you generate and store embeddings for the uploaded documents during the session, or are you using the plain text directly for retrieval?
- How do you manage the lifecycle of these embeddings (creation, storage, deletion) tied to user sessions?

Topic		Replies	Views
Is it possible to train a model from my own private documents? API plugin-development , api , large-language-model , training	5	7858	May 24, 2024
Implementing a file upload in my application using open ai api API gpt-4 , chatgpt , plugin-development , api , chatgpt-plugin	7	7669	January 25, 2024
Replicating ChatGPT's behavior of attaching a document using OpenAI API API chatgpt , api	3	645	July 22, 2024
RAG or Fine tuning for a domain specific QA chatbot API rag , development , chatbot , assistants-api	4	1485	July 3, 2024
Strategy to Train Model using R.A.G API training	5	2396	October 18, 2023