Embeddings as model input

OoDeLally · June 16, 2023, 1:59pm

Hello there,
I’ve read about embeddings, and from what I’ve seen it is always a “text =(model)=> vectors” direction. Is it possible to feed the model’s prompt with vectors, in order to save some tokens?
IMHO it looks like what fine-tuning is doing, however fine-tuning seems to be a offline operation, while I would like to provide the vectors during a live-prompt.
Is that something possible, in practice or even in theory?
Thanks

_j · June 16, 2023, 3:26pm

There’s just one problem: GPT can’t do the required math.

Foxalabs · June 16, 2023, 3:28pm

do you mean the LLM model, 3.5-turbo or gpt-4? they are trained on tokens as input, not sure what value a vector would be to the model. It speaks tokeneese, you are proposing a conversation in vectorish.

curt.kennedy · June 16, 2023, 3:40pm

The vectors are quite big usually. For example, ada-002 embeddings are 1536 floats. Supposing each float is at least 4 tokens, this is 6000 tokens! Not much compression.

Besides using embeddings in the traditional sense, you could take the embedding vectors, and use them as an input to your own neural network. Usually a simple feed-forward network is a place to start, where the input is the vector of 1536 floats (or whatever your embedding dimension is) and you have however many hidden layers, ending with your final output layer. So if you create a binary classifier, your output layer has dimension 2.

Topic		Replies	Views
How can I send vectors as a chat context? Prompting embeddings	8	9024	May 15, 2023
Does openAI provide API that takes Embeddings as an input? API embeddings	10	4288	December 18, 2023
Embedding tokens vs embedding strings? API	12	8333	February 11, 2024
Embeddings for tokens used by GPT models? API	2	928	December 17, 2023
Request irrelevant query for a chatbot API gpt-4	9	476	April 1, 2024

Embeddings as model input

Related topics