Hi folks,
I am trying to develop a simple question and answer app using RAG technology for our own academic purpose. The RAG system I developed was very simple, following many public accessible examples using LangChain (including OpenAIEmbedding, FAISS, and ChatOpenAI). The performance was not very good. For example, when I ask “What viruses were studied in this paper?”, the app may not be able to get it right.
I also tried to directly upload the pdf file to chatGPT 4, and it was able to answer much better. When I asked GPT4, it says it used the myfiles_browser
tool to access and review the document. I wonder how I can access “myfiles_browser” through API? I have thousands of papers that I need to answer and get the answers for dozens of questions.
If “myfiles_browser” is not available through API, how can I improve my RAG app’s performance in terms of accuracy?
Thank you so much!