Weight at 0 reduce training tokens?

rotuladosjas01 · April 14, 2024, 4:26pm

Does anyone know if using “weight” = 0 in the training file for finetuning reduces the number of tokens? I imagine it does, since it doesn’t train with that message.

Can someone confirm?

_j · April 14, 2024, 8:40pm

For reference, there is a new parameter that one can pass on a particular message within an example conversation of chat completions training:

To skip fine-tuning on specific assistant messages, a weight key can be added disable fine-tuning on that message, allowing you to control which assistant messages are learned. The allowed values for weight are currently 0 or 1.

You are right, the method that would be employed here is unclear, and why there would be any value of true “skip” beyond just removing the message yourself if the AI model was not passed the tokens for training. The only thing a 0-weight might do is break up messages so they are trained more individually instead of on a sequence of runs that starts with or continues into another message - thus requiring the tokens of no impact.

I suspect that such a mechanism could have future or internal (special partner) use as a float value, passing a learning rate per message token, and would not disable the tokens or the tokens count, just the algorithmic impression that the tokens make on learning, by the very definition of weight.

rotuladosjas01 · April 19, 2024, 4:11pm

Thanks for your answer, but I was testing once the finetuning of the model was finished, the number of training tokens if it goes down when using “weight”: 0.

And about why use the “weight”: 0, basically if you work with RAG, where for each user message you change the system message, each model response has a different context

Topic		Replies	Views
What does the "weight" parameter do when fine-tuning API fine-tuning , data-preparation	0	117	June 18, 2024
Unable to get "weight" field to work Bugs fine-tuning , fine-tuning-problems	5	189	June 20, 2024
Chat-Gpt3 Fine-Tuning Questions API chatgpt , fine-tuning	2	576	December 8, 2023
Token Count for Fine-tuning API fine-tuning	4	1976	December 18, 2023
Why does a 1115 length fine-tuning model file costs 1,520 trained tokens? API	3	1008	March 29, 2023

Weight at 0 reduce training tokens?

Related Topics