What does the "weight" parameter do when fine-tuning

harperg · June 18, 2024, 5:38pm

I see in the docs that you can assign a “weight” to your assistant messages to “skip fine-tuning” on specific messages.

Can anyone elaborate on what this does exactly? I am looking to use it to give my bot the ability to recover from its own mistakes. Is this the intended use?

For example, if you had the following convorsation:

User: What is the capital of the united states

Assistant: Los angeles, (weight=0)

User: It is?

Assistant: Sorry, I got mixed up there. The capital is Washington.

Will this work? Or will it train the gpt to answer incorrectly?

Topic		Replies	Views
Unable to get "weight" field to work Bugs fine-tuning , fine-tuning-problems	5	261	June 20, 2024
Weight at 0 reduce training tokens? API fine-tuning , api , pricing	2	320	April 19, 2024
Do the fine tuned models bake in anything from system/user? API fine-tuning	0	34	August 7, 2024
GPT Fine-tune. Need to fine tune a model that uses is references or dictionary API fine-tuning	4	501	January 28, 2024
Does fine-tuning freeze past messages of the AI? API	1	300	August 31, 2023

What does the "weight" parameter do when fine-tuning

User: What is the capital of the united states

Assistant: Los angeles, (weight=0)

User: It is?

Assistant: Sorry, I got mixed up there. The capital is Washington.

Related topics