Which database tools suit for storing embeddings generated by the Embedding endpoint?

curt.kennedy · September 22, 2023, 8:00pm

So a few seconds, 10 seconds? What ballpark is it?

And this is for your 3.5k embeddings, right? Or was it 4.5M embeddings?

RonaldGRuckus · September 22, 2023, 8:10pm

This may be of interest:

They do offer some datasets for quick testing:

sergeliatko · September 22, 2023, 8:15pm

The link above kinda shows why I haven’t even bother to measure

Yes, 3.5K objects containing several text fields, each using embeddings of 1536 dimensions/vector

curt.kennedy · September 22, 2023, 8:20pm

Wow, some impressive numbers!

I can see the downside, for folks like me, is that since I have sparse traffic, the hosting costs would eat me alive.

The tech is cool though. I have looked into FAISS as an algorithm, but the naive argmax works just fine for me (for now)

But will have to keep Weaviate in mind for sure

sergeliatko · September 22, 2023, 8:26pm

That’s why I’m using their cloud services…

curt.kennedy · September 22, 2023, 8:30pm

In my case, I can run 400,000 embeddings @ 1-2 seconds latency for less than $1 per month, assuming system is settled post-cold start and no elaborate database backups, with sparse traffic.

Here my major cost is backups, oddly enough.

High volumes of traffic might drive me to a Weaviate. At that point it might be close on cost, but I’m pretty sure I’d have to ditch argmax to get latency anywhere close to Weaviate on latency!

I would have to ditch multiplies and go with the Manhattan metric, and code it efficiently (probably vectorized on the entire batch of embeddings at once). That might give me a fighting chance

Topic		Replies	Views
Storing embeddings in SQL Server? Latency between Redis & Pinecone? Vector DB recommendations? API	18	7224	December 23, 2023
Using Redis for embeddings API	21	12731	December 23, 2023
Creating a Chatbot using the data stored in my huge database Community embeddings , chatgpt , fine-tuning , api	93	75811	November 25, 2023
Reducing Cost of GPT 4 by using embeddings Prompting	23	10156	May 4, 2023
Introducing Embeddings Announcements	33	8703	November 27, 2023

Which database tools suit for storing embeddings generated by the Embedding endpoint?

Related topics