Tweaking the Amount of Epochs

lucasheinrich95 · February 10, 2022, 12:26pm

So I’ve been experimenting with finetuning, and I’m particularly interested in how to finetune effectively on a relatively small data set. (<100 prompt completion pairs).

Is there a general rule of thumb for GPT-3 finetuning in terms of epochs. Do fewer examples mean I should run more or fewer epochs?

Likewise, I notice when a certain number of epochs is exceeded, the model just memorizes the answers verbatim. So one should rather tend to fewer epochs, right?

The default is 4 epochs, should you deviate from this with less training data?

Thanks in advance

boris · February 10, 2022, 9:02pm

Yes, that’s roughly right - thanks MA!

First of all, the bigger the model, the better it’ll perform with a small amount of examples. The best way to get the better performance is to spend time creating a few more examples, rather than optimizing hyperparameters.

Number of epochs just means how often does the model see each example - a higher the number, the “better” the memorization. For generative use cases 2 epochs is generally better, as it reduces memorization, and increases generalization. However if you have very few examples, you often can’t get away without increasing the amount of epochs, so that you can perform weight updates at least some reasonable amount of times.

So I guess my suggestion is to try davinci fine-tuning with 3 epochs or so. If you reduce the number of epochs the model largely won’t learn very well, and if you increase it too much it’ll very quickly memorize all the examples.

Topic		Replies	Views
How many Epochs for fine-tunes? API	7	15434	December 28, 2023
How does openai calculate the number of epochs in the fine-tuning api? API	5	5057	December 28, 2023
Fine-tuning doesn't seem to improve quality for me API fine-tuning	6	975	June 10, 2023
Parameters for Question & Answers samples Prompting	3	737	December 23, 2022
How do I run a fine tune for another epoch without running it for the whole thing? API fine-tuning , api	8	1184	December 23, 2023

Tweaking the Amount of Epochs

Related topics