Ideally this benchmark is similar to SWE-Bench where any RAG provider is free to put their system to the test and compete for a spot on the leader board.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
RAG is failing when the number of documents increase | 35 | 17685 | December 17, 2024 | |
RAG Evolution with Reasoning Models | 10 | 218 | April 30, 2025 | |
Scaling RAG chatbot system to millions of documents | 18 | 5952 | February 28, 2024 | |
How can RAG systems be improved for more complex queries | 3 | 3536 | October 31, 2023 | |
We've been building the open source ultimate RAG backend and are launching our V2 | 9 | 2242 | January 5, 2025 |