Overcoming many small files using Assistants Retrieval

evankozliner · November 26, 2023, 5:22pm

Hey Open AI folks

I’m interested in building an app that pulls in a bunch of news articles, summarizes them, and cites its sources.

One problem I’m facing with the way the Assistants API is structured is that it seems to lend itself to individual larger files, per the below limit:

…a maximum of 20 files per Assistant, and they can be at most 512 MB each

Because of the file number limitation I can’t treat each article as an individual file.

Some potential workarounds I’ve thought of:

1. Grouping articles together in the same large file
This feels the most natural, but based on the API responses in the playground, it doesn’t seem like I will be able to determine the which articles were actually used for the summary, as the response only includes the file IDs use for the summary, not the line numbers

2. Creating many small assistants
This will require finding some scheme for grouping the articles in groups of at most 20. Adds a lot of technical complexity and doesn’t seem like the way assistants was intended to be used.

3. Building my own RAG
This is simply leaving OpenAI and developing my own system for this.

Let me know your thoughts! Is there a better approach I can take?

anon10827405 · November 26, 2023, 5:38pm

You’ll need to use some sort of retrieval system to determine which assistants to use, which defeats the purpose.

This in my opinion is the best option. The current retrieval system is incredibly lacking and not usable in production. One powerful feature in RAG that retrieval lacks is hybrid search which combines semantic search alongside keywords.

You can use Weaviate and find much more control, much better results, and for much cheaper.

evankozliner · November 26, 2023, 5:41pm

Yeah I’ve been playing around with Neum / weaviate as well but saw this offering and thought it might save me time. All good thanks for the response!

Topic		Replies	Views
RAG with more than 10 files API assistants-api	9	4674	January 15, 2024
Assistants seems to struggle citing multiple sources with Retrieval API assistants	7	1092	November 26, 2023
New "Assistants" API a potential replacement for low level "RAG" style content generation? API	9	8604	March 4, 2024
Did assistant api kill manual RAG with vector databases? API	8	6732	December 18, 2023
The 20 File Limit on assistants is not useful for large Retrieval-Augmented Generation API assistants	5	3649	November 24, 2023

Overcoming many small files using Assistants Retrieval

Related topics