Looking for best practices for using vector database + storing metadata + chaching

dan.meier.dmm · July 5, 2023, 1:27pm

Looking for best practices for using vector database + storing metadata + caching.

I need to embed continuously new documents into my vector database and want to make them searchable (the pages) and thus want to store somewhere the metadata but still be able to scale the application and not be limited by storing metadata in the vector database (like in pinecone)

gassetteedward77 · July 5, 2023, 2:30pm

我在港口都快接不上这里的信号了，有可以用的代码帮助对接一下端口信号么！？矢量数据代码应该可以解决端口链接单元上系统出故障的日志问题！。

wfhbrian · July 5, 2023, 5:52pm

I recommend starting with a simple JSON file. This will give you an easy, flexible, and forgiving environment for experimenting and figuring out what works for you.

500MB JSON file ~= 25,000 ada embeddings

JavaScript’s JSON.stringify scales to ~500MB file with no problem. And nearly all modern computers have 500MB of RAM to spare, so you can parse the file and keep the vectors in memory for fast access.

I haven’t had reason to migrate away from this for my personal notes since the 500MB/25K embeddings limit handles my requirements and then some.

akhilshekkari21 · August 1, 2023, 6:45pm

how to convert json data to documents in langchain? Later I would be able to convert those documents into embeddings

Topic		Replies	Views
More reasons for metadata in vector store API vector-db	6	1946	May 26, 2024
Vector Database that can embed new data continuously Community vector-db	5	4016	January 24, 2025
Need advice storing structured JSON to VectorDB Community langchain , pinecone , vector-db , lost-user	4	975	July 1, 2024
I have almost 1 MB of txt file and i want to create embeddings. What is best approach to store in time optimized manner. i want to make it work for javascript API embeddings , ada , vector-db	7	1933	January 24, 2024
Best way to save html files in vector store API langchain	4	7407	October 9, 2023

Looking for best practices for using vector database + storing metadata + chaching

Related topics