Fine tuning success stories - new 2023 models, what are your results?

_j · September 22, 2023, 6:44pm

Do you have held-back validation of the quality where you could do a random shuffle on both files?

I found something that is kind of “well, duh!”: when printing a number after the separator of the natural word “sentiment:” untrained: the stats of the numbers do get significantly stronger if they have a space after the colon and you’re also not asking the AI to generate that space. The long tail of word token possibilities that start with space is eliminated.

So just take the model that was producing spaces and put another unstripped space at the end of your application’s prompt for that model when using.

Or for new train/use, a separator that ends with newlines.

No stop sequence is needed if you only allow one token

sergeliatko · October 9, 2023, 8:05pm

I use <|endoftext|> for generations with various lengths, and stopped them completely on reformatting tasks where the generation is heavily based on input

Topic		Replies	Views
Fine-Tuning In a Nutshell with a Single Line JSONL File and n_epochs Documentation	89	35542	December 13, 2023
Do you fine tune? If so why? API	34	4796	December 25, 2023
Get all requested max tokens with gpt-3.5-turbo-instruct API gpt-35-turbo-instruc	20	7587	January 21, 2024
Should prompts be unique for fine-tuning? Prompting	9	1797	December 25, 2023
New 4-turbo model has a unique limit? Or is this a bizarre hallucation? API	18	4621	January 26, 2024

Fine tuning success stories - new 2023 models, what are your results?

Related topics