How to implement the similar feature like chat with uploaded file feature in ChatGPT

0xffmeta0 · February 19, 2025, 1:25pm

In ChatGPT, we can uploade pdf/images to the chat window, and then ask following questions. I’m just wondering how to implemnet a similar feature in my own chat bot.
I know with the assistant + thread api, we could achieve the feature, but the cost might be high.
So not sure how ChatGPT implement the feautre? It’s also using assistant api?

0xffmeta0 · February 20, 2025, 12:28pm

Does ChatGPT also convert the pdf to text in the background? And when sends the message history to api, it’s actually converted into text already.

pretendlake · February 20, 2025, 8:55pm

You’re on the right direction, but it works a bit differently, let me explain:

The pdf gets split into small text chunks (like a line from a paragraph)
Each chunk is converted to an embedding and saved in a vector database along with the text chunk

To better understand the purpose of embeddings and how to use them, the documentation explains it very well:
https://platform.openai.com/docs/guides/embeddings

When a user sends a message, this message is also converted to an embedding
The vector database is queried to find embeddings which are the closest in distance to the user message embedding (meaning the vector is semantically similar to the user message)
The text chunks from the most similar embeddings are passed as context to the GPT as a system prompt

TLDR: No, the whole pdf is not being sent as context on every message. Instead, by making use of embeddings, we identify the parts of the pdf that are related to the message and pass only those parts as context.

This technique is also known as RAG (Retrieval Augmented Generation)

Topic		Replies	Views
Replicating ChatGPT's behavior of attaching a document using OpenAI API API chatgpt , api	3	695	July 22, 2024
Answering questions about text file content API	5	9047	December 15, 2023
What is the API equivalent of uploading a PDF? API gpt-4o	1	4931	June 20, 2024
I want to upload pdf file in chatgpt and ask for a summary of it in one go? API	3	2187	June 10, 2024
Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API API gpt-35-turbo , chatgpt , fine-tuning , api	7	7090	December 13, 2023

How to implement the similar feature like chat with uploaded file feature in ChatGPT

Related topics