Fine tuning success stories - new 2023 models, what are your results?

_j · September 22, 2023, 12:16pm

If it ain’t broke…right?

Just to clarify:

separator: The last part of your prompt. It tells the AI that your output format shall begin after a particular sequence is seen. It is the ultimate “prompting” to the AI.
stop sequence: The last part of your completion. It is a particular phrase and token sequence that when produced, will trigger the matching API’s “stop” strings, and terminate the likely continuing or repeating output.

Goofy completion model tricks:

Useful stop sequence fine-tuning

{“prompt”: “prefix:<prompt text>”, “completion”: “<ideal generated text>prefix:”}

You set the stop sequence as your same input role prompting. This is the pattern of chatbots, but most wouldn’t think to fine-tune this way.

Why: If you turn off the stop sequence, you get an AI that keeps on simulating these turns. You can see it make a bunch of fine-tuned responses to its own fine-tuned inputs.

Going past the prompt you trained on:

given normal fine-tune training:
{“prompt”: “<prompt text> #—>”, “completion”: “<ideal generated text>##STOP!##”}

You can disobey your own prompt style. This could have some interesting uses for probing performance.

Overcomplete prompt:
"A ripe banana#—>Sure, I’ll write a story
"I like frogs#—>{“sentiment”: “happy”, "length

You can go deeper into your fine-tune, and redirect the output to particular example, particular part of output, or even out-of-domain. See what is produced.

Incomplete Prompt:
“A ripe”

When the AI completes your own prompt, you see what the fine-tune is expecting from you, and how long before (and if) it makes the separator on its own by being strongly-tuned.

Topic		Replies	Views
Fine-Tuning In a Nutshell with a Single Line JSONL File and n_epochs Documentation	89	33695	December 13, 2023
Do you fine tune? If so why? API	34	4554	December 25, 2023
Get all requested max tokens with gpt-3.5-turbo-instruct API gpt-35-turbo-instruc	20	7407	January 21, 2024
Should prompts be unique for fine-tuning? Prompting	9	1713	December 25, 2023
New 4-turbo model has a unique limit? Or is this a bizarre hallucation? API	18	4455	January 26, 2024

Fine tuning success stories - new 2023 models, what are your results?

Just to clarify:

Goofy completion model tricks:

Related topics