Embeddings advice

alexandru · July 7, 2022, 5:28pm

Hello,
I have a two data sets, one rather small 500 units, each around 850 characters, and a larger one, 2500, each around 750 chars each.
I created indexed the embeddings for both data sets using text-search-ada-doc-001 and embedding the query with text-search-ada-query-001. These are 1024 dimensions embeddings.
The results are good in both cases, however for the smaller set is rather fast 5-7 seconds, for the larger one around 15-20. I guess it goes up linearly as i am doing the dot product using memory (in a ruby on rails app )
I would like to ask if anyone has experience with this and would like to share some advice improving this in any way.
One option i am considering is storing them in a milvus database and hopefully get better timings.
Second option would be to find a faster way of calculating the dot products for each query.

I would also be curios to know if for this sizes of my texts would i benefit from using babage or curie for larger dimensions 2000 or 4000, or is it not worth the extra overhead, considering the timings would also increase doing this, monetary part aside ?

daveshapautomator · July 7, 2022, 5:57pm

There’s also FAISS which is open source. I’ll be experimenting with FAISS soon.

alexandru · July 7, 2022, 6:28pm

have read a bit about it but don’t know exactly the flow, will keep an eye on your youtube

daveshapautomator · July 7, 2022, 9:05pm

Okay I wrote a thing that should help. It is not optimized but it works!

joyasree78 · May 5, 2023, 7:58pm

The home page says file not found and does not show the github

Topic		Replies	Views
Semantic text search using Embeddings in a web application API	1	771	December 17, 2023
Offline Embedding Options Community embeddings	8	9169	June 23, 2023
Using Redis for embeddings API	21	13219	December 23, 2023
OpenAI Embeddings - Search through ~1000 PDFs API embeddings	3	3211	August 28, 2024
Storing embeddings in SQL Server? Latency between Redis & Pinecone? Vector DB recommendations? API	18	7632	December 23, 2023

Embeddings advice

Related topics