Some questions about text-embedding-ada-002’s embedding

curt.kennedy · November 9, 2023, 7:18pm

Thanks for pointing this massive error out!

I updated the code above with this line:

U = U[D:,:] # take All But The Top!

It is working much better now! (More spread)

But now I’m wondering how many dimensions I should really drop. It’s set to 15 right now, but whoever uses this needs to examine this situation in more detail.

PS, I am not using this in operations now, so I have very little insights. But the variance is higher and the cosine similarities still make sense when the ABBT are dropped.

Topic		Replies	Views
Question on text-embedding-ada-002 API	12	6409	December 24, 2023
Can text-embedding-ada-002 be made deterministic? API embeddings , ada	18	7849	December 24, 2023
Why `OpenAI Embedding` return different vectors for the same text input? API	35	10446	April 30, 2024
Embeddings and Cosine Similarity API	20	14458	February 25, 2024
Creating a Chatbot using the data stored in my huge database Community embeddings , chatgpt , fine-tuning , api	93	87817	November 25, 2023

Some questions about text-embedding-ada-002’s embedding

Related topics