NEFTune: adding noise to vector embeddings

jon.pfirman · March 2, 2024, 6:32pm

I recently came upon a fine tuning technique that involves adding noise to embedding vectors during training. Results have been interesting on llama with reports of efficiency jumps from 29.79% to 64.69% with the inclusion of noisy embeddings.

I know hugging face allows you to pass neftune_alpha as a parameter when fine tuning and I was wondering if there was a way to do that with openAI?

N2U · March 2, 2024, 11:28pm

OpenAI does not use neftune as a parameter, but you can use learning_rate_multiplier to achieve the same goal. More information over here:

https://platform.openai.com/docs/api-reference/fine-tuning/create#fine-tuning-create-hyperparameters

_j · March 3, 2024, 3:32am

Link to paper instead of micropayment blog site:

We hypothesize that by adding noise to the embeddings at train time, the model overfits less to the specifics of the instruction-tuning dataset, such as formatting details, exact wording, and text length. Instead of collapsing to the exact instruction distribution, the model is more capable of providing answers that incorporate knowledge and behaviors of the pretrained base model.

Topic		Replies	Views
What does fine-tuning do? API fine-tuning	5	1821	February 7, 2024
Is it possible to fine tune the embedding model? API	20	20078	March 29, 2024
Fine tuning: what is it good for? Community fine-tuning	5	11632	October 12, 2023
Embeddings vs finetunes API	7	2888	January 16, 2023
How to correctly fine tune my own model? API	3	2659	January 21, 2023

NEFTune: adding noise to vector embeddings

Related topics