Fine-tuning with conversation format: Which messages are used for training?

_j · September 8, 2023, 5:28pm

From the OpenAI cookbook on fine-tune chat model:

During the training process this conversation will be split, with the final entry being the completion that the model will produce, and the remainder of the messages acting as the prompt. Consider this when building your training examples - if your model will act on multi-turn conversations, then please provide representative examples so it doesn’t perform poorly when the conversation starts to expand.

(and again an example where just the prompt gives the desired results)

Topic		Replies	Views
Fine tuning data format for chatting history API chatgpt	2	378	March 20, 2024
How does gpt-3.5-turbo fine-tuning work? API gpt-35-turbo , fine-tuning	10	1913	September 11, 2023
How closely does my training data need to match my prompt sequencing for Fine-tuning to be effective? API fine-tuning , training	7	980	February 6, 2024
Correct format for dataset in chat model fine-tuning API fine-tuning , documentation	4	1952	January 9, 2024
Finetuning for shortening prompts Documentation fine-tuning	10	3840	December 24, 2023

Fine-tuning with conversation format: Which messages are used for training?

Related topics