Fine tuning reducing randomness

alex_g · December 5, 2021, 9:38am

Hi all
Tried a couple configurations with fine tuning on Curie. In general, really cool results, but for some prompts, it just gets stuck on repeating the prompt. I increase temperature to the max, but to no avail.
Also, tried to avoid overfitting the data (don’t have a lot of it) so i just tuned 50 examples, one epoch, with 0,025 learning rate. Still stuck with repeating. Anyone got a way out?

Thanks a lot!

daveshapautomator · December 5, 2021, 1:22pm

Not the prompt, but sometimes I noticed the response gets stuck on repeat. I suspect it’s because I used a synthetic dataset for finetuning and did not include enough variety when making it.

It could also be your keyword. How are you ending your prompts? I do this:

[[original message]]

DEMARK WORD:

I do the training very consistently with two newlines followed by a very distinctive word that denotes the end of the input. Make sure you use the same exact format at inference time.

alex_g · December 5, 2021, 1:39pm

Thanks! I actually use \n####\n in the end of prompt, and \n\n in the end of completion for the training… might be worth adding some stop sequence in the end of the completion maybe… following what you’re saying. I’m getting repetitions in the response as well (looping towards the end), but this is easier to take care of.

daveshapautomator · December 5, 2021, 4:30pm

It’s best not to use whitespace at the end, and also the ##### is unclear, so you might want to switch to something more specific, or concrete. This is especially important if you end up making a finetune model with multiple purposes.

Topic		Replies	Views
Should prompts be unique for fine-tuning? Prompting	9	1681	December 25, 2023
Fine-tuning divinci and end of response issues API fine-tuning , api	4	1373	May 24, 2023
Finetuned a model, but it replies like insane API	7	1160	December 24, 2023
Fine Tune Completion not stopping API fine-tuning	4	1760	May 5, 2023
Fine-tuned model in a chatbot gives responses for both the chatbot and the user API	5	2329	March 16, 2023

Fine tuning reducing randomness

Related topics