💬 Training an embedding adapter: adapt embeddings to new context, and boost the performance of RAGs,

cyzgab · February 11, 2024, 2:09am

Hi all,

I’ve put together a simple package to train an adapter matrix to fine-tune your embeddings to a new context. This includes OpenAI’s embedding models.

You’ll need the embeddings of the query-document pairs, and a label on whether the document is relevant to the query or not.

The idea is to have a simple and familiar api.

from embedding_adapter import EmbeddingAdapter
adapter = EmbeddingAdapter()
adapter.fit(query_embeddings, document_embeddings, labels)
adapter.transform(new_embeddings)

More details in the repo’s README.

Any feedback would be most appreciated

lomz · February 25, 2024, 9:47pm

This is awesome! I’ve considered training a small adapter layer after reading a Tweet about doing this and in my search to find that and related research I found your project.

Do you have any data on how this compares with cosine similarity or links to research supporting this technique?

Topic		Replies	Views
RAGxplorer: Visualising Document Chunks in the Embedding Space Community embeddings	8	1199	January 27, 2024
Possible novel Embedding classification technique API	1	536	July 1, 2022
Anyone have any experience training linear layers on top of embedding? API	2	827	January 18, 2023
Is it possible to fine tune the embedding model? API	20	13944	March 29, 2024
ChatGPT 3.5's fine-tuning or embeddings or both? API embeddings , fine-tuning	5	4706	August 25, 2023

💬 Training an embedding adapter: adapt embeddings to new context, and boost the performance of RAGs,

Related Topics