Standard Dataset for Semantic Similarity

rex.vanhorn · September 8, 2023, 4:42pm

Hi, community.

I am using ADA 002 to compare generated text to standard/ideal text. It works well, but I need to justify using ADA 002 over BERT or another embedding technique. The easiest way to do this is to evaluate ADA’s performance against BERT’s, etc. on some standard set of texts.

Does a standard evaluation set exist for measuring embedding techniques’ performance on semantic similarity?

(I used translations of the Bible - e.g., NIV vs KJV, NKJV, BBE, etc. - but my thesis requires a more-standardized set for evaluation)

Thank you!

Foxalabs · September 8, 2023, 4:46pm

You can see a comparison/performance table of various embedding models here

It should be noted that different use cases find different models to be more performant, ada has a very good all around usage score and is an excellent option. It’s difficult to pick a “best”, it’s more “best for what?”

Topic		Replies	Views
Ada embedding vs SBERT API	1	4380	December 24, 2023
Ada002 performance in german compared to other embedding methodes Community gpt-4 , ada	1	1007	June 22, 2023
GPT3 vs SBERT for semantic search/similarity? API	1	1966	January 24, 2023
Embeddings Alternative to Ada-002 API embeddings , api	4	4638	November 20, 2023
Testing Ada 002 and Other Embedders with Large Texts Prompting embeddings	3	1268	December 11, 2023

Standard Dataset for Semantic Similarity

Related topics