Hi everybody!
I am a product manager for a digitally supported real estate agency. Our aim is to provide information to ChatGPT through API and “feed” it with the following types of information:
- Knowledge base (can be provided in any format like .txt, .pdf, etc.)
- Classic data (information about our properties for example with address, can also be provided in various formats)
- Previous conversations of the users (or their background info)
It is not a knowledge base at the moment but expected to grow. We might want to upload a lof of new data in the future to it. It therefore needs to be scalable.
I saw that there is a section in Playground called Storage which gives you the ability to upload Files and Vector Stores.
I also found the API documentation about these things and think we will be able to manage using it without any issue.
However, my questions are the following:
- What exactly ist he difference between Files and Vectore Stores?
- Which one of these both is better or do you need both anyway?
- If vector stores are needed. Is there any way to let that be done by a third party? Can you suggest any you have had experience with?
- What option (files or vector stores) are cheaper? Are the files we upload counted one time as token or what would you recommend?
- Is this the right way? Or should I go for this project totally differently as it might not be scalable or so?
If you know any good guide, tutorial or similar (for a beginner/amateur) that would be highly appreciated.
Thanks for your help!