After fine-tuning my base model for a new scenario,new scenario might have overridden the old one scenario?

Hi all, I am working on an application, and we have encountered an issue. We initially fine-tuned the OpenAI GPT-3.5-turbo model with a specific dataset for one scenario, and it produced good results.

Later, we used the fine-tuned model as the base model and added some additional data for the same scenario with the same system prompt. The resulting fine-tuned model also worked well with the training dataset.

However, when we fine-tuned the model a third time using the most recent fine-tuned snapshot for a different scenario with a different system prompt, the output model performed well for the new scenario. But when I provided input related to the old scenario with old system message, the model did not respond appropriately.

What happened to my model? It seems that the recently fine-tuned model should have knowledge of both scenarios, but the new scenario might have overridden the old one.

Can anyone help me find a solution?

Welcome to the Forum!

I just responded to a very similar question here:

Not 100% sure but it is my best guess.

Thank you for your quick response. could you provide some additional information.

1 Like

What specifically is unclear?

is the recent fine tunned model differentiate the two scenarios or not, my model does not identify my old scenario input

Ok. So in your case you are trying to expand your existing fine-tuned model with an additional scenario (scenario 2). At the same time you’d like the fine-tuned model to still properly respond to scenario 1 cases.

In order for the model to recognize that it is supposed to differentiate between two different scenarios, you need to make this clear in your training data. That means, in your new training data set you need to not only include examples for scenario 2 but also again examples for scenario 1. This way the model should be able to handle both cases.

I believe that if you only train the existing fine-tuned model with new cases, then it essentially overrides the existing training and the model will only recognize the new cases.

1 Like

I will try to include both scenarios in my training examples at a time . Thanks a lot, buddy!

2 Likes

Let us know how it goes. These case studies are definitely interesting and also helpful for other Forum users. Good luck!

2 Likes