Validation loss vs. full validation loss

_j · April 5, 2024, 6:19am

New fine-tuning has some new things I haven’t fully explored that simplify advice I’ve handed out before to continue fine training in small steps.

Specifically: you have the n_epochs hyperparameter that specifies how many passes will be done through your training data. There is now checkpoints - individual models produced at the end of each epoch pass, so you can find where they become overfitted (which at the learning rate of your graph, happened quite early.

https://platform.openai.com/docs/guides/fine-tuning/use-a-checkpointed-model

This also is the source of the “full validation loss” report, more information about internal quality points besides just running some inference during batching.

You can read more at the link above, as I would only be reading it myself and distilling it, needing to fill in the unclear sections with experimentation I’ve not done.

Topic		Replies	Views
Is "Validation loss" calculated over the whole validation set? API fine-tuning	1	636	December 21, 2023
Specific Meanings of Training Loss, Validation Loss, and Full Validation Loss? API fine-tuning , fine-tuning-problems	3	2401	August 18, 2024
The validation loss stops being plotted after a certain step API fine-tuning , gpt-4o	0	37	October 27, 2024
Training loss=good, Validation loss=good API fine-tuning , api , fine-tuning-problems	8	3938	April 5, 2024
Loss Function in Fine Tuning API gpt-35-turbo , fine-tuning , fine-tuning-problems	3	6875	September 29, 2023

Validation loss vs. full validation loss

Related topics