OpenAI Announces Text and Code Embeddings in the OpenAI API

bakztfuture · January 25, 2022, 6:50pm

OpenAI just announced the Text and Code Embeddings endpoint:

We are introducing embeddings, a new endpoint in the OpenAI API that makes it easy to perform natural language and code tasks like semantic search, clustering, topic modeling, and classification. Embeddings are numerical representations of concepts converted to number sequences, which make it easy for computers to understand the relationships between those concepts. Our embeddings outperform top models in 3 standard benchmarks, including a 20% relative improvement in code search.

https://openai.com/blog/introducing-text-and-code-embeddings/

antoine.descamps · January 31, 2022, 7:51am

Just to tease OpenAI. An interesting thread regarding the efficacy of embeddings:

lmccallum · January 31, 2022, 5:24pm

Hi @m-a.schenk, can you point me to OpenAI’s response? I couldn’t find it on LinkedIn, and I’m not on Twitter…thx

lmccallum · February 2, 2022, 2:50pm

Thank you. Arvind took the high road - nice. Nils claims were just too silly. Not that GPT-3 is perfect or the best - I certainly don’t know - but I get annoyed when I see things like “1 million times more expensive.” That’s why I don’t make time for twitter.

asabet · February 3, 2022, 3:21am

@lmccallum his criticisms are actually quite valid and well grounded, especially wrt cost trade-offs. The nuance here is that the standard benchmarks he mentions may not correlate well with real-world dataset performance - which is what openai embeddings seem to be optimized for. Regardless, every practitioner should have their own in-house benchmarks for making these judgments.

Topic		Replies	Views
Introducing Embeddings Announcements	33	8801	November 27, 2023
GPT Text embedding api sucks API api	1	53	May 9, 2025
Reducing Cost of GPT 4 by using embeddings Prompting	23	10522	May 4, 2023
Quality of embeddings using davinci-001 embeddings model vs. ada-002 model API embeddings	15	4081	April 9, 2024
New Embedding model released by OpenAI at 20% cost of older model API chatgpt	0	1305	January 25, 2024

OpenAI Announces Text and Code Embeddings in the OpenAI API

Related topics