Fine-tuning loss graph peaks

lyh6030 · December 7, 2023, 6:00am

I fine-tuned a GPT model for a classification task, and I observed intermittent peaks in the loss graph. The train and validation loss didn’t move smoothly but rather fluctuated between 0 and then suddenly spiked to values around 6-8 in one step, and then back to 0 in the next. There are some inaccuracies in the labeling of the training data. Could this be affecting it? If not, I am curious to know why this phenomenon is occurring.

Foxalabs · December 8, 2023, 6:14am

Errors in the training data, will of course affect the performance of the mode, but if those errors are few, then that may be acceptable.

However, errors in the evaluation set will cause odd looking performance metrics .

If your eval set is a subset of your training data, as is typical, then you may need to do some work to clean the test set at least. There are also other unusual aspects to the way the OpenAI training system works that can display some unusual training performance graphs, the cause for those I am unsure of.

Topic		Replies	Views
Fine-tuning sometimes fails API fine-tuning	0	267	February 7, 2024
Finetuning Noob : Guidelines and Best Practices? API chatgpt , fine-tuning	1	1657	September 30, 2023
Help with fine-tuning, think I'm over-fitting, but not sure API fine-tuning	7	1347	December 21, 2023
Fine-tuning results question API	1	323	December 7, 2023
Searching for help with fine tuned results Prompting	2	387	May 27, 2022

Fine-tuning loss graph peaks

Related Topics