Why cosine_similarity between embedding vectors is always above .68

It’s been discussed a bunch, here is one example I found:

Some people normalize it. What I do is adjust my thresholds. Usually anything above 0.9 is correlated. Anything less than 0.8 is uncorrelated. And between 0.8 - 0.9 is the grey zone.

But these are rough values, and you should adjust from here given your observations on your own data set.

1 Like