Fine-tuning on conversations

Helveticus · January 13, 2023, 2:54pm

Hello

I have a dataset consisting of dialogues between two people which I would like to use for fine-tuning GPT. Please see below for two example dialogues. The dialogues vary in length and can be longer than the examples.

Is the format of the conversations ok? For fine-tuning, should I just concatenate all conversations or do I have to use a separater between the conversations (if yes, which separater)?

First Dialogue:

user1:
Hey there. What’s up?

user2:
Not much, just hanging out. What about you?

user1:
Just thinking about what I’m going to do this weekend. You?

user2:
Probably just relaxing. What do you have planned?

user1:
I’m thinking about going to the beach. It’s supposed to be nice this weekend.

user2:
That sounds like a great plan! Have you been to the beach recently?

user1:
Not in a while. It would be nice to get out and enjoy the sun.

user2:
Definitely! I’m sure it’ll be a great time. Do you have any other ideas for the weekend?

Second Dialgoue:

user1:
Good morning. What is your profession?

user2:
Good morning. I’m an accountant. What about you?

user1:
I’m a software engineer. How long have you been an accountant?

user2:
I’ve been an accountant for about five years now. What about you? How long have you been a software engineer?

user1:
I’ve been a software engineer for three years. What do you like most about accounting?

user2:
I like how challenging it can be. There’s always something to learn or something new to figure out. What do you like most about software engineering?

user1:
I like how creative it can be. I get to come up with new ideas and new ways of solving problems. It’s a great feeling when you can come up with something that works.

curt.kennedy · January 13, 2023, 3:01pm

For a generic back and forth, try creating a jsonl file like this:

{“prompt”: “Hey there. What’s up?\n\n###\n\n”, “completion”: " Not much, just hanging out. What about you?“}
{“prompt”: “Not much, just hanging out. What about you?\n\n###\n\n”, “completion”: " Just thinking about what I’m going to do this weekend. You?”}

Note how they have 50 percent overlap.

I’d be curious to see what you come up with!

Helveticus · January 14, 2023, 11:12am

Thank you very much. Could it be a problem that with your version we only have one previous message taken into account but usually for a conversation many more previous message are important for the context?

curt.kennedy · January 14, 2023, 2:52pm

You are correct, there is only the previous input taken into context to produce the next output. But your example seemed to have a gentle back-and-forth feel to it, without a deeper prior context.

To have a deeper context ‘virtual you’, I think you need at least two fine tuned models. And the more I think about it, I think you also need several classifiers running in the background, as well as several non-AI based RegEx pattern cross-correlation engines.

Here is at least my early thoughts on creating a deeper ‘virtual you’ without the extra stuff I just mentioned above:

micamecava · April 3, 2023, 8:50pm

Did you figure anything out about this @Helveticus ? @curt.kennedy ? I’m also looking for fine-tuning gpt3 on the conversational data where the data is highly contextual.

curt.kennedy · April 4, 2023, 2:34pm

@micamecava I found a possible personally extracting solution over here Extracting Personalities from past Conversations? - #6 by curt.kennedy

Topic		Replies	Views
Extracting Personalities from past Conversations? API	17	3563	January 21, 2024
OpenAI Fine-Tuning: Multi-turn Dataset Examples API openapi , fine-tuning , gpt-3	6	8696	December 14, 2023
Train back and forth dialogues Prompting	14	1794	December 17, 2023
Fine-tuning GPT-3 on entire conversations to mimic style and extract relevant knowledge API	13	4927	December 16, 2023
Fine Tune to emulate my chat style API fine-tuning	7	2678	December 8, 2023

Fine-tuning on conversations

Related topics