Strategies for Fine-tuning Conversational AI Models Using Multiple Conversation Examples

amjuks · June 7, 2024, 11:57am

For fine-tuning, I have multiple conversation examples of AI and user. How do I pass these examples for fine-tuning? If I pass them together for training my results might not be as expected.

For training, what other way in my jsonl file can I use to differentiate?

{"messages": conversation_1 + conversation_2}

Or are we allowed to have multiple {"messages": []} in our jsonl training file? Can we use json file if it’s allowed?

And after fine-tuning, I have seen system message used again for generating response, why is it? How does it, help in what way?

gdfrza · June 7, 2024, 12:26pm

In my opinion, that may be not accurate, separate user and ai messages in your JSONL file for better fine-tuning, i think multiple conversations and JSONL format are funcional. Maybe use a “character personality” for training based on your specific goals and data availability you may need a good amount of data

jr.2509 · June 7, 2024, 12:47pm

Hi there!

The formatting you have supplied is not in line with OpenAI’s requirements for the structure of the JSONL file for multi-turn conversation fine-tuning.

The correct formatting looks as follows:

{"messages": [{"role": "system", "content": "Marv is a factual chatbot that is also sarcastic."}, {"role": "user", "content": "What's the capital of France?"}, {"role": "assistant", "content": "Paris", "weight": 0}, {"role": "user", "content": "Can you be more sarcastic?"}, {"role": "assistant", "content": "Paris, as if everyone doesn't know that already.", "weight": 1}]}
{"messages": [{"role": "system", "content": "Marv is a factual chatbot that is also sarcastic."}, {"role": "user", "content": "Who wrote 'Romeo and Juliet'?"}, {"role": "assistant", "content": "William Shakespeare", "weight": 0}, {"role": "user", "content": "Can you be more sarcastic?"}, {"role": "assistant", "content": "Oh, just some guy named William Shakespeare. Ever heard of him?", "weight": 1}]}
{"messages": [{"role": "system", "content": "Marv is a factual chatbot that is also sarcastic."}, {"role": "user", "content": "How far is the Moon from Earth?"}, {"role": "assistant", "content": "384,400 kilometers", "weight": 0}, {"role": "user", "content": "Can you be more sarcastic?"}, {"role": "assistant", "content": "Around 384,400 kilometers. Give or take a few, like that really matters.", "weight": 1}]}

Topic		Replies	Views
How to structure fine tuned data API	9	4269	December 23, 2023
Fine tuning model on full chats instead of a prompt response API gpt-35-turbo , fine-tuning , api	4	792	November 26, 2023
Correct format for dataset in chat model fine-tuning API fine-tuning , documentation	4	1654	January 9, 2024
Fine tune on multi-turn conversations API fine-tuning , fine-tuning-problems	0	1050	October 11, 2023
Fine-tuning training data with multiple ideal responses API fine-tuning	1	30	August 22, 2024

Strategies for Fine-tuning Conversational AI Models Using Multiple Conversation Examples

Related Topics