How the file chat process works

EliasBrazil · December 5, 2023, 11:10pm

Hello everybody.
I’m creating a chatbot for internal use that will respond based on PDF files converted to text.
However, these files are very large and I cannot pass all the content via API as it would consume many tokens and even be impossible in some cases.

I would like suggestions and some clarification on the best way to obtain the most relevant text to send as context via API.

Do I necessarily need to use a vector database?
Any tools to extract the most relevant text as context?
Any other tips will be very important, as I am a first-timer.

supershaneski · December 5, 2023, 11:30pm

The least effort way you can do this right now is via the Assistants API. You can start now by creating your own Assistant, select a model to use (GPT-4-1106-preview, etc), add Instruction(system prompt), upload your PDF file (maximum file size is 512 MB), turn on the Retrieval from the Tools and start chatting. Just pay attention to the tokens you are consuming. You can select 3.5 models if you just want to get the hang of things before you try GPT-4 models. Also, files uploaded for Retrieval is free until 1/12 next year and Code Interpreter is free to use until 12/13 this month.

EliasBrazil · December 5, 2023, 11:50pm

Thanks for your response. This seems very promising although the issue of token consumption seems like a problem, as I didn’t see how the developer could control it, I saw some here in the community reporting very high spending using the Assistant.

SomebodySysop · December 6, 2023, 3:35am

Still the best way to do it: https://youtu.be/Ix9WIZpArm0?si=tKIb0RzffnU-3UPe

Topic		Replies	Views
Answering questions about text file content API	5	8629	December 15, 2023
Best Practice to save money on Calling Assistant API API gpt-4 , api	3	941	November 24, 2023
How to read large files using OPENAI API? API gpt-35-turbo , api	7	8110	December 17, 2023
Creating a bot using 100+ PDFS as the knowledge base API	19	12875	August 15, 2024
Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API API gpt-35-turbo , chatgpt , fine-tuning , api	7	6855	December 13, 2023

How the file chat process works

Related topics