Fine-tuning a model so it adopts style and tone of voice

olivierB · September 20, 2024, 3:06pm

Hey there,

Disclaimer: I’m not an engineer!

I’m trying to fine-tune GPT 4.o with a 400+ row dataset of my social media posts. The idea is to train it on my ton of voice for future posts.

I read in the documentation that for each row I would need to add content for the system, which is ok because it’s always the same context for all posts.

However, I also need to add content for the user for each post, which in my understanding means writing what would have been a prompt to output the corresponding post for each post, which is not very convenient/feasible.

Any idea on how to solve this?

thanks

jr.2509 · September 20, 2024, 3:13pm

Why is it not feasible? Because of the associated workload?

One way to approach it would be to reverse engineer the input, i.e. the content for the user message. That is, you could create a few examples of what your prompt would normally look like. You then use few shot prompting to create similar prompts/user messages for your all your other social media posts that you intend to include in your fine-tuning data set.

For the latter you can ask the model to write you a Python script so the process is fully automated if you don’t have coding skills yourself or don’t have someone who can assist you with it.

olivierB · September 20, 2024, 3:19pm

Yes I’v thought of this : asking GPT to write the prompt based on the output, that’s what you mean correct?

As I’m not an engineer, I would do that with no code tools and API calls. However, it seems quite costly in terms of API calls, especially with a large database, right?

So I’m wondering if that input is mandatory so the fine-tuning works well. I have read in another post that leaving this field blank is also a good option. But again I’m not an expert, so wanted to understand the pros and cons of this.

jr.2509 · September 20, 2024, 3:22pm

Yes.

For your case, you may not necessarily need that many training examples. You can fine-tune for language and get decent results back with as little as 50-100 good examples. I would use that as a starting point and then see how the model performs.

I have not tried this technique myself so can’t directly comment on it. What type of instructions / context do you currently have in your system message? Somewhere you need to provide instructions on what the blog post is about etc. Where is that currently included in your content?

olivierB · September 20, 2024, 3:32pm

For the system, in each database row I have “You are a world-class Linkedin content writer and your role is to write Linkedin posts for me. All your posts need to be punchy and straight-to-the-point.”

I’m having trouble understanding what should go in the user content and the system content to be honest. For example the sentence “all your posts…”

jr.2509 · September 20, 2024, 3:34pm

Well, assuming that every post should address a different topic, you need to include somewhere what that topic should be and, ideally, provide some contextual information that the model should use in writing the post.

Normally, you would include this information in the user message.

Your system message is basically fine as is (you can exclude the part “for me”). However, you definitely need to include the info on what the model should write the post about

olivierB · September 20, 2024, 3:38pm

Ok got you.

“However, you definitely need to include the info on what the model should write the post about ”

In the user message for each database row, correct?

jr.2509 · September 20, 2024, 3:40pm

Not sure I understand the database part correctly. You will need to create a JSONL file with your examples consistent with the format shown here.

In the user message you would include the info on the post topic and, if applicable, additional contextual information.

olivierB · September 20, 2024, 3:41pm

Yes ok perfect. By database in meant JSONL file, my bad.

Great will try all this, thank you so much for your help!

Topic		Replies	Views
I need more examples of fine-tunning AI. I added around 1500 promps but API gpt-4	13	323	March 29, 2025
Fine tuning for writing style - lessons and questions API fine-tuning	5	3089	January 17, 2024
A tool to rewrite and format a draft with company tone of voice API fine-tuning , api	5	147	October 8, 2024
Fine-tuning for more natural responses API fine-tuning	4	472	January 13, 2025
Training gpt-3.5 to autocomplete for a niche domain and a specific writing style Community chatgpt	13	1880	July 25, 2024

Fine-tuning a model so it adopts style and tone of voice

Related topics