Avoid overfitting during the fine-tuning of gpt-3.5 turbo

uro.sh · November 6, 2023, 3:15pm

Thank you for your fast answer @_j .
I agree a “small dataset” is not enough for the good fine-tuning, but it should be enough to give me a glimpse of what is going on under the hub, since so tiny level of control is provided with the current API.
You mentioned “Otherwise your 100 would just be added to their 10 million”. However, I think the exact opposite happened for me, and the fine-tuning just used my hundreds of examples to fine-tune the whole model (not just the last layers for example).
Let me be more concrete. Tor the training/validation process I have provided ~100 different examples of the same-topic conversation between two actors (their character definition is in the system prompt). After the fine-tuning (7 epochs) I have tested the model. If I follow the script (screenplay), the model behaviour is decent. If, in the middle of the conversation, “user” actor make an abstract segue, and asks “What is the meaning of the life?” (similar sentence was not defined in the training dataset), the other actor’s answer will be generated to follow the expected conversation route at that exact place, and will totally omit that abstract question (even though there was a line in the system prompt saying: “not to blindly follow the script, but to talk naturally”).
Does that mean that I have to include as many general and abstract questions in my training/validation data, to be able to generalize of the model’s abilities? How is all of that general knowledge overridden from the basic, pre-trained model? And why the system prompt itself is not helping with this?

Topic		Replies	Views
Poor fine-tuning results of GPT 3.5 API	3	1116	February 21, 2024
Fine Tuned Chatbot forgets how to output summary of conversation API	9	1830	December 18, 2023
Gpt3 turbo not giving the good result even after fine-tuning API	14	1842	September 18, 2023
Finetuning for shortening prompts Documentation fine-tuning	10	3791	December 24, 2023
Questions about fine-tuning GPT-3.5-turbo API fine-tuning	1	2128	October 29, 2023

Avoid overfitting during the fine-tuning of gpt-3.5 turbo

Related topics