Storing embeddings in SQL Server? Latency between Redis & Pinecone? Vector DB recommendations?

raymonddavey · February 9, 2023, 9:20pm

We ended up creating our own engine.

Do you need all the embeddings for every query. Some of our clients break their embeddings into categories, and use a different database for each area

Eg different areas of law, different topics within a University etc

10 billion embeddings is a lot. If each one is 100 tokens long, are you encoding 1 trillion words/tokens? is that correct?

Out of interest, how long is the text you are embedding for each entry? You may be able to combine entries in some way to have longer blocks of text, and therefore less embeddings - But it will depend on your use case.

Topic		Replies	Views
Using Redis for embeddings API	21	12929	December 23, 2023
OpenAI Embedding vector database API	5	2982	December 23, 2023
Which database tools suit for storing embeddings generated by the Embedding endpoint? API	46	25109	December 13, 2023
Best architecture for searching historical emails semantically? API	25	4464	August 22, 2024
Embedding and searching from similar embeddings API	6	6250	October 27, 2023

Storing embeddings in SQL Server? Latency between Redis & Pinecone? Vector DB recommendations?

Related topics