Does fine-tuning freeze past messages of the AI?

bakhtiyar · August 31, 2023, 12:35am

I intend to fine-tune a model that would be aware of its previous mistakes. E.g.

#### Example 1
User: what's 2x2?
AI: 4.
User: Are you sure?
AI: Absolutely.

#### Example 2
User: what's 2x2?
AI: 5.
User: Are you sure?
AI: Sorry, I meant 4.

I’d like to know if the first message of the AI is considered frozen from the optimization point of view. Otherwise, with this approach, I would teach the model to say 5, when it knows it’s 4.

_j · August 31, 2023, 12:42am

I suspect the second example will be showing it how it should respond to the first turn of a similar conversation.

More than the answer, it trains the behavior of the AI to reverse its position when questioned.

The only way to know is to guess and try, making a small fine-tune with high epochs to make a strong example and see what it likes to repeat back.

Topic		Replies	Views
Incremental Fine-Tuning and Maintaining Conversation History API fine-tuning	3	934	March 17, 2024
Do the fine tuned models bake in anything from system/user? API fine-tuning	0	38	August 7, 2024
Continuous fine-tuning - Best Practices? API	5	4539	November 22, 2024
How does gpt-3.5-turbo fine-tuning work? API gpt-35-turbo , fine-tuning	10	1897	September 11, 2023
Does Finetuning data need to be perfect? API api	3	741	January 25, 2024

Does fine-tuning freeze past messages of the AI?

Related topics