Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API

novaphil · August 18, 2023, 6:15pm

“Chat over documents” is generally handled by splitting the document into chunks, creating embeddings on the chunks, and storing them in a vector database. Then on a user query, create embedding of the query, perform a similarity search in the vector database to retrieve the chunks, then put those relevant chunks in the prompt. There’s a bunch of tutorials on YouTube and elsewhere on how to do this. LangChain is likely the easiest way to get started with a proof-of-concept (although wouldn’t recommend using it for a large-scale production app).

Topic		Replies	Views
Sending large document via API call and asking for a question over complete document? Prompting api	3	1800	February 26, 2024
Is there any way by which I can let GPT-4 API summarize large PDF texts? API gpt-4 , api	10	11468	May 6, 2024
Retrieval Augmented Generation (RAG) with 100k PDFs?! Too slow! Community pdf , llm , rag , development	13	25089	October 31, 2024
Use case: asking questions about a specific document API	7	2358	June 12, 2023
Answering questions about text file content API	5	9024	December 15, 2023

Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API

Related topics