Over fitted model unable to reproduce training set

rong_txyz · August 14, 2024, 1:21am

We finetuned the data using jsonl with system/user/assistant, and the training loss is approaching 0.
Yet, when I selected an example from training set and pair with the same system message, changing temperature to 0 and TopP to 0.5, I am still unable to reproduce the training set

jr.2509 · August 14, 2024, 7:16am

Welcome to the Forum!

Can you share details about your fine-tuning use case, i.e. what did you try to achieve with the fine-tuning?

rong_txyz · August 29, 2024, 3:01am

I am using the system/user/assistant format, where each time the user would ask a question and the assistant would give an answer,usually the question is 10-20 token and answer in the length 50 tokens

dignity_for_all · August 29, 2024, 3:55am

Consider splitting a portion of your fine-tuning dataset to use as a validation set during the fine-tuning process.

Comparing the loss rate with the data not used in fine-tuning can help verify reproducibility.

Topic		Replies	Views
Training loss=good, Validation loss=good API fine-tuning , api , fine-tuning-problems	8	4512	April 5, 2024
Fine tuned with wrong data initially API fine-tuning-problems	11	1460	December 23, 2023
Fine tuned model doesn't perform on production environment API chatgpt	6	704	August 29, 2023
Generating unwanted answers in Fine-tuning API fine-tuning , fine-tuning-problems	2	54	November 6, 2024
Fine tuned model produces responses that make it seem like it hasn't been fine tuned at all API fine-tuning , fine-tuning-problems	1	1514	September 14, 2023

Over fitted model unable to reproduce training set

Related topics