Proposal for Persistent Knowledge Base with PDF Attachments in OpenAI Models

I propose that OpenAI enables developers to attach PDF files (or similar documents) to a model’s knowledge base once at setup. This would allow the model to refer to those documents continuously without requiring developers to resend portions of the documents in each API call. OpenAI could charge a base fee based on the size and number of attached PDFs, benefiting both developers and OpenAI.

Key Features:

  1. Persistent Knowledge Base: Allow models to store and reference PDFs or documents attached during setup. The model should use these documents for all future queries.
  2. Base Fee Model: Charge developers a base fee depending on the number and size of the PDFs attached. This creates a scalable revenue stream for OpenAI while offering developers greater efficiency.
  3. Increased Token Usage: By allowing this feature, developers can build applications more efficiently, which would lead to increased usage of tokens for actual queries, benefiting OpenAI.

Benefits:

  • Developers can build applications faster and more efficiently without dealing with token limitations for large documents.
  • OpenAI benefits from a new base fee revenue stream and increased API usage.
  • Users will experience improved performance and reliability when working with large knowledge bases.

Conclusion:
This approach is a win-win for both OpenAI and developers, as it speeds up application development while driving more usage of OpenAI’s API services.

1 Like

Hi @masoud1 - welcome to the Forum!

Have you had a chance yet to take a look at OpenAI’s Assistant’s API? It essentially offers exactly that capability, i.e. you can upload knowledge files to a vector store in a one-off process and then access the information in the files through its file search tool when you interact with the model.

1 Like

Hi @Jen,

Thank you!

However, thats not the solution I am looking for. With the AI Assistant, you still have to search the knowledge base and include the search results as part of the prompt to the AI model, which is not what I am proposing.

Thank you!

1 Like