Contextualized Embedding: Get GPT3 Embeddings of Each Token in a Sentence

siliconlife · March 25, 2023, 10:56pm

Say, I want to get the contextualized embedding of the word love in the sentence I love her. How can I do it via the GPT3 embedding API?

Assuming there is a sentence of T tokens and the model output dimension is D, then in BERT or GPT2 model, we can get the T x D embeddings so the corresponding embeddings of the targeted word’s token(s) are the contextualized embedding. But the GPT3 Embedding’s API only give 1 x D embedding of every input. Therefore, it seems that there is no way to get the contextualized embedding of a specific word in a sentence?

gladjoy · July 5, 2023, 8:02am

Exactly my question. Also BERT has vocab.txt which is a list of all tokens, what’s the counterpart in openai embedding?

Topic		Replies	Views
Extracting each word's embeddings from embedded sentence API embeddings	3	224	August 28, 2024
Embeddings for tokens used by GPT models? API	2	824	December 17, 2023
Is it possible to get "context aware" embeddings? API embeddings	9	1129	August 31, 2024
Does openAI provide API that takes Embeddings as an input? API embeddings	10	3413	December 18, 2023
Text embeddings vs word embeddings API embeddings	1	2238	September 4, 2023

Contextualized Embedding: Get GPT3 Embeddings of Each Token in a Sentence

Related topics