Order of fine tuning

SecMovPuz · January 21, 2022, 9:51pm

Recently I made a post asking about how the order of information in training effects the responses you get.

You can see that linked here: Location of information in training

In that we determined that the order does effect responses generated by gpt-3. We also determined that it is part of how gpt-3 works and there are ways to structure prompts to make this less apparent but you can’t get rid of it. Me and my partner recently started a project that requires fine tuning so we started trying to figure it out. I once again had the same question: Does the order of information in a fine tuning document effect the responses it will give and if so, how much and in what way. We are new to fine tuning and don’t fully understand it. Thanks for your help!

SecMovPuz · January 22, 2022, 3:30am

This is correct. It is how you order each example in the file.

boris · January 22, 2022, 4:07am

Correct. We also randomly shuffle the examples over and over again, so the order of examples shouldn’t make a difference.

The order of words within a single example will make a difference on the other hand.

SecMovPuz · January 23, 2022, 7:41am

Thanks, this helps alot. I don’t have to worry about ordering the data so it will understand it which has been a pain with the training, as it would “forget” training in earlier parts.

seokhyunan · November 9, 2023, 7:34am

Then, Do the API automatically and randomly sample the data from the dataset to construct a batch for each step? i.e., Do I have not to initially shuffle the dataset (that must be shuffled for normal fine tuning) before submitting .jsonl file to fine tuning API?

Thank you for your help!

Topic		Replies	Views
Order of finetuning data? API	4	783	August 16, 2024
Does the line order of the jsonl file affect fine tuning result? API api	0	542	November 9, 2023
Do the fine tuning API automatically shuffle the dataset? API api	5	928	April 10, 2024
Location of information in training Prompting	6	720	December 14, 2021
How closely does my training data need to match my prompt sequencing for Fine-tuning to be effective? API fine-tuning , training	7	1166	February 6, 2024

Order of fine tuning

Related topics