After fine-tuning my base model for a new scenario,new scenario might have overridden the old one scenario?

vkeerthichavla · August 28, 2024, 10:34am

Hi all, I am working on an application, and we have encountered an issue. We initially fine-tuned the OpenAI GPT-3.5-turbo model with a specific dataset for one scenario, and it produced good results.

Later, we used the fine-tuned model as the base model and added some additional data for the same scenario with the same system prompt. The resulting fine-tuned model also worked well with the training dataset.

However, when we fine-tuned the model a third time using the most recent fine-tuned snapshot for a different scenario with a different system prompt, the output model performed well for the new scenario. But when I provided input related to the old scenario with old system message, the model did not respond appropriately.

What happened to my model? It seems that the recently fine-tuned model should have knowledge of both scenarios, but the new scenario might have overridden the old one.

Can anyone help me find a solution?

jr.2509 · August 28, 2024, 10:40am

Welcome to the Forum!

I just responded to a very similar question here:

Not 100% sure but it is my best guess.

vkeerthichavla · August 28, 2024, 10:48am

Thank you for your quick response. could you provide some additional information.

jr.2509 · August 28, 2024, 10:52am

What specifically is unclear?

vkeerthichavla · August 28, 2024, 10:55am

is the recent fine tunned model differentiate the two scenarios or not, my model does not identify my old scenario input

jr.2509 · August 28, 2024, 11:03am

Ok. So in your case you are trying to expand your existing fine-tuned model with an additional scenario (scenario 2). At the same time you’d like the fine-tuned model to still properly respond to scenario 1 cases.

In order for the model to recognize that it is supposed to differentiate between two different scenarios, you need to make this clear in your training data. That means, in your new training data set you need to not only include examples for scenario 2 but also again examples for scenario 1. This way the model should be able to handle both cases.

I believe that if you only train the existing fine-tuned model with new cases, then it essentially overrides the existing training and the model will only recognize the new cases.

vkeerthichavla · August 28, 2024, 11:41am

I will try to include both scenarios in my training examples at a time . Thanks a lot, buddy!

jr.2509 · August 28, 2024, 11:49am

Let us know how it goes. These case studies are definitely interesting and also helpful for other Forum users. Good luck!

Topic		Replies	Views
Issues with Overwriting Context in Sequential Model Fine-Tuning Community gpt-35-turbo , chatgpt , fine-tuning-problems	1	49	August 28, 2024
Fine tuned with wrong data initially API fine-tuning-problems	11	1491	December 23, 2023
Fine-tuning a fine-tuned model does not mean extanding the previous dataset (prompt and complexion examples) API	10	2588	December 23, 2023
Incremental Fine-Tuning and Maintaining Conversation History API fine-tuning	3	948	March 17, 2024
Fine tune update model not work properly API gpt-35-turbo , fine-tuning , api , fine-tuning-problems	7	875	December 23, 2023

After fine-tuning my base model for a new scenario,new scenario might have overridden the old one scenario?

Related topics