Fine-tuning with conversation format: Which messages are used for training?

From the OpenAI cookbook on fine-tune chat model:

During the training process this conversation will be split, with the final entry being the completion that the model will produce, and the remainder of the messages acting as the prompt. Consider this when building your training examples - if your model will act on multi-turn conversations, then please provide representative examples so it doesn’t perform poorly when the conversation starts to expand.

(and again an example where just the prompt gives the desired results)

3 Likes