The embeddings of Open AI is a black box and not much documentation is available. I did some testing between Cohere and Open AI embedding with the below three content. I found out that Cohere is giving me better control on the similarity score. It will be great to hear other opinion on this. May be I am not using the Open AI embedding correctly. The three texts that I have are
Romwe Women's Plus Size Short Sleeve Surplice Deep V Belted Ruched Mini Party Bodycon Dress
95% Polyester, 5% Spandex
Tie closure
High stretchy material with good softness, comfortable to wear
The party bodycon dress feature with wrap v neck, batwing sleeve, self tie waist and ruched detail
Good choise for party, cocktail, evening, prom, nightout, club and work
The elastic material hugs your figure perfectly and the bodycon cut creates a seductive silhouette
Please refer to the size measurement in image before ordering
Romwe Women's Plus Size Casual Drawstring Twist Front Cut Out V Neck Short Sleeve Summer Sexy Bodycon Dress
100% Polyester
Pull On closure
High stretchy, soft and comfortable
Cut out, drawstring, v neck, high waist mini dresss for women
Good choise for party, cocktail, club, date, work, holiday, casual and formal wear
Keep this formal dress with high heels and additional jewellery for a chic look
Please refer to the size measurement in image before ordering
COOFANDY Men's Muscle Fit Button Down Dress Shirt Long Sleeve
50% Cotton, 48% Polyester, 2% Spandex
Imported
Button closure
Machine Wash
ăWrinkle-Freeă High quality woven fabric, lightweight and breathable, wrinkle free dress shirts with a clean look, keeps your body dry and comfortable all day.
ăSoft Cotton FabricăThis long sleeve shirts are light and comfortable to wear. Elastic fabric fits perfectly on all body type and allows greater mobility in any direction with no restriction, making you enjoy activewear levels of comfort and mobility.
ăFashionable DesignăMale dress shirts always come in a variety of types. Classic solid/plaid one never goes wrong. Slim fit stylish dress shirt with classic turndown collar, button up closure, long sleeve and metal contrast buttons makes you more handsome and attractive.
ăOccasionsăYou can pair this long sleeve button down shirts with chinos/jeans for casual daily wear, or match the stretchable shirt with dress pants for classy look. This smart shirt is essential in mens wardrobe and greats for all season, Suitable for office, business, date, night out, club, travel and casual daily wear.
ăGarment CareăMachine washable. â¤The fabric of this plaid dress shirt differs from one with solid color, which is more elastic. Please refer following size chart in the product description to choose best fit for you
I then use both Cohere and Open AI to embed them and store in Supbase. Then when I run a cosine similarity with the question âList slim fit long sleeved shirts for menâ. I get the below results
Open AI embedding
1st Text - 0.874352702871908
2nd Text - 0.874352702871908
3rd Text - 0.866211994130881
Cohere Embedding
1st Text - 0.753884081524262
2nd Text - 0.762143687765075
3rd Text - 0.822315609664169
I have a threshold of 0.79, so with Cohere, I am getting the right retrieval