Offline Embedding Options

stevenic · June 22, 2023, 1:21am

I’m seeking out advice from the community for any options they might be aware of for the generation of embeddings without the need to call a cloud service. This is for Vectra, my local Vector DB project and is related to a question I got from a user. It looks like TensorFlow might be an option but I’m wondering if there are other options and if anyone in the community can comment on the quality of the Tensor Flow embeddings in the context of semantic search, compared to either OpenAI of HF’s embeddings?

Offline embeddings are not only interesting as a cost savings measure but also in the context of search over private data where you don’t want any data leakage to an external cloud.

sam.saffron · June 22, 2023, 2:12am

I saw this posted a while back:

and

The issue you will hit though is that it requires reasonable amounts of compute. Also hosting models is not trivial, you need monitoring an API … etc…

frank_behr · June 22, 2023, 3:18am

Look at this: https://instructor-embedding.github.io

Foxalabs · June 22, 2023, 8:34am

Considering the current cost of embeddings, I think the depreciation of the hardware you ran local ones on would be greater than the cost incurred by using the API.

tb.songipark · June 22, 2023, 9:03am

agree. especially on cloud, it’s way more expensive to use ec2 with gpu.

sarthak.srivastava · June 22, 2023, 9:10am

if i am using embeddiing approach using
text-embedding-ada-002

in this what am i doing
am i hitting openai api or not? beacuse in the code i have not specify any endpoint , just given the model

please expalin me someone

Foxalabs · June 22, 2023, 10:41am

yes, you will be hitting the ada endpoint when you embed if you specify text-embedding-ada-002

sarthak.srivastava · June 23, 2023, 6:08am

suppose if i have question answr pair as text data
but now if i want too do convert as a embed like this

embedding_model = “text-embedding-ada-002”
embedding_encoding = “cl100k_base”

what am i doing here
if i am using cl100k_base that means i am hitting ada endpoint for coverting text data into embedding data

and one other question is that
can i save embedded data in sql server or not

and how will be the query search

because i have embedded data that has answer in vector form but question is in text

petermax · June 23, 2023, 6:22am

Try Sentence Transformers.

https://www.sbert.net/

You might want to start with one of the many pretrained models e.g. «all-MiniLM-L6-v2» that is lightweight (just 80MB) and fast and yields good results.

Topic		Replies	Views
Is it possible to write a tool to build vectors (embedding) yourself? Prompting	6	1994	March 16, 2023
Reducing Cost of GPT 4 by using embeddings Prompting	23	10623	May 4, 2023
When using embedding models, why to ask normal models instead of embedding ones? API embeddings , api	6	1229	April 17, 2024
OpenAI Embeddings - Search through ~1000 PDFs API embeddings	3	3458	August 28, 2024
Calculating embeddings costs API	8	10413	September 5, 2023

Offline Embedding Options

Related topics