Embeddings for tokens used by GPT models?

curt.kennedy · November 1, 2023, 1:30am

You could try piecing together theories by looking at open source transformer models, or your own models.

But since the latest GPT models are all private, who besides the OpenAI engineers know?

They aren’t transmitting the internal buffers and arrays in the API from these private models. It’s a black box.

But for fun, you could embed each token, create a vector library of all tokens, and then spin your own model based on this mapping of tokens to vectors.

Topic		Replies	Views
Using a Custom Tokenizer with GPT Embeddings API	5	3687	March 4, 2024
Embeddings as model input API embeddings , api , prompt	3	2427	June 16, 2023
Does openAI provide API that takes Embeddings as an input? API embeddings	10	4126	December 18, 2023
Learning token embeddings API codex	0	537	May 10, 2022
Embedding tokens vs embedding strings? API	12	8082	February 11, 2024

Embeddings for tokens used by GPT models?

Related topics