I’m using the OpenAI embeddings service and calculating cosine similarity in Java. I am using Milvus VDB for the cosine similarity calcs and also my own calcs.
Normally, cosine values range from -1 to +1 for arbitrary points in an N-dimensional space. But I only get results ranging from 0 to 1. I’m guessing text/image/audio embeddings have certain characteristics that make cosine values restricted in the range from 0 to 1.
Does this make sense? Any help is appreciated.