Help with fine-tuning, think I'm over-fitting, but not sure

curt.kennedy · September 26, 2023, 5:37pm

Yeah @edmund, I saw the same weird TL curve when fine-tuning a binary classifier on this thread over here:

My training file had 4000 examples, and the system decided to choose 3 epochs for this amount of data. So with only 3 epochs, I don’t feel I was overfitting, and all examples were totally different tokens going in (no repeats).

When I get some time, I was going to monitor this model with the old Babbage, and see if there are any discrepancies, or degradation in model performance (since the old model was 4 epochs, and used the same training data).

But initial spot-checks show the new “overfit” model is performing correctly. Just need more data to be confident.

But the TL curve going to 0 is disturbing!

Topic		Replies	Views
Training loss=good, Validation loss=good API fine-tuning , api , fine-tuning-problems	8	4747	April 5, 2024
Finetuning Noob : Guidelines and Best Practices? API chatgpt , fine-tuning	1	2607	September 30, 2023
Questions about fine-tuning GPT-3.5-turbo API fine-tuning	1	2127	October 29, 2023
Poor fine-tuning results of GPT 3.5 API	3	1111	February 21, 2024
Why are "Training loss" and "Validation loss" so high API gpt-35-turbo , fine-tuning , fine-tuning-problems	7	727	June 20, 2024

Help with fine-tuning, think I'm over-fitting, but not sure

Related topics