Vector Database that can embed new data continuously

abdullah.mujahid · August 8, 2023, 4:59am

Hello everyone,

I’m new to the field of AI and I’m currently working on creating a Chatbot tailored to engage with customers using personalized information.

Moreover, I’m interested in continuously augmenting my chatbot’s knowledge base to ensure it remains up-to-date. I’ve come across the term “semantic search” which may do it or maybe not, but I’m uncertain about its implementation and the approach.

I’m seeking advice and guidance from the community regarding the best approach to address this challenge. Are there alternative vector databases apart from Pinecone that you would recommend? Pinecone appears to be beyond my budget at the moment.

I appreciate your assistance and insights. Thank you.

Foxalabs · August 8, 2023, 8:55am

Hi and welcome to the forum!

Personally I use ChromaDB, there are many others if you search for open source vector database.

If you append all of the text from the user and from the LLM into the vector database you should have a vector searchable addition to the context, how you distribute that data and how you incorporate the searches back into your future prompts will be what defines your application from other “MemoryGPT” systems.

abdullah.mujahid · August 8, 2023, 10:26am

I don’t fully understand, can you shed a bit more light on this?

Let me give you an example scenario. I am building an application where I have 4 documents to provide info about and I train my system for those 4 documents, in future, I add another document, and I don’t want to re-train my model.

Foxalabs · August 8, 2023, 10:42am

For example this product : https://memorygpt.io

kartik2 · September 18, 2023, 11:21am

Hello!
There is a product called ‘SuperDuperDB’, recently revield to the market
github - SuperDuperDB/superduperdb

Continuos integration of database with vector database is one of the key capability!
Please have a look!

Thanks

sakshamgoel · January 24, 2025, 8:16pm

Great Question!
Pathway fully supports what you’re describing. Its architecture allows for recomputations and updating only the parts of your data source affected by changes or new additions/deletions. This ensures that RAG chatbots operate efficiently at scale while always leveraging the most up-to-date knowledge.

I’d be happy to connect and share more details about how these underlying recomputations and updates.

Topic		Replies	Views
Vector Store Database recommendations for chat app (not assistant?) API	5	361	July 5, 2024
Vector embedding notes and chat history API embeddings , chat-completion , vector-db	4	2176	June 6, 2024
Response speed with semantic searching API	2	1133	December 29, 2023
Daily Updation of Knowledgebase API chatgpt , api , vector-db , knowledge-files , vector-store	1	31	January 24, 2025
Storing embeddings in SQL Server? Latency between Redis & Pinecone? Vector DB recommendations? API	18	7426	December 23, 2023

Vector Database that can embed new data continuously

Related topics