Is "Validation loss" calculated over the whole validation set?

ajh308 · December 21, 2023, 7:01pm

I’m fine-tuning a model and I don’t know how exactly to interpret the results.
If a point early on the loss curve has much lower val loss than a later point, does that mean it’s actually better?
Or was it just evaluating on an easy subset of the validation set?

(example: compare the highlighted timestep vs. the final timestep)

curt.kennedy · December 21, 2023, 7:21pm

Maybe the calculation of validation loss is local …

In the past, I have gotten weird loss curve like this, where I am training on 500,000 tokens and 4,500 examples, and the curve goes to zero, then has this weird bump at the end.

Overall the model appears to be fine, even with this odd curve. I base this on trending models with the new fine-tuning system, and the old system with the same fundamental training data.

Topic		Replies	Views
Validation loss vs. full validation loss API api	2	1596	April 5, 2024
Loss Function in Fine Tuning API gpt-35-turbo , fine-tuning , fine-tuning-problems	3	7542	September 29, 2023
Training loss=good, Validation loss=good API fine-tuning , api , fine-tuning-problems	8	4920	April 5, 2024
The validation loss stops being plotted after a certain step API fine-tuning , gpt-4o	0	49	October 27, 2024
Specific Meanings of Training Loss, Validation Loss, and Full Validation Loss? API fine-tuning , fine-tuning-problems	3	3140	August 18, 2024

Is "Validation loss" calculated over the whole validation set?

Related topics