Hey Open AI folks
I’m interested in building an app that pulls in a bunch of news articles, summarizes them, and cites its sources.
One problem I’m facing with the way the Assistants API is structured is that it seems to lend itself to individual larger files, per the below limit:
…a maximum of 20 files per Assistant, and they can be at most 512 MB each
Because of the file number limitation I can’t treat each article as an individual file.
Some potential workarounds I’ve thought of:
1. Grouping articles together in the same large file
This feels the most natural, but based on the API responses in the playground, it doesn’t seem like I will be able to determine the which articles were actually used for the summary, as the response only includes the file IDs use for the summary, not the line numbers
2. Creating many small assistants
This will require finding some scheme for grouping the articles in groups of at most 20. Adds a lot of technical complexity and doesn’t seem like the way assistants was intended to be used.
3. Building my own RAG
This is simply leaving OpenAI and developing my own system for this.
Let me know your thoughts! Is there a better approach I can take?