A tool to rewrite and format a draft with company tone of voice

trentingianluca1 · October 8, 2024, 7:07am

Hello,

In the context of customer support, I am building a tool that basically takes as input (1) the conversation history and (2) a draft of the new response and tries to rewrite the draft in a complete email and format it so that matches the company’s tone of voice.

At the moment, I tried with prompt engineering and I got decent but not production ready results.

Then, I prepared a database for finetuning composed of 50 conversations+drafts elements and the corresponsing final email the way I want it to come out in the model response. With this dataset, I created a finetuned model based on gpt-4-mini.

I was ready to be amazed by my super cool fine tuned model, but it performs worst than the prompt-engineering version. It adds random words and weird stuff in the response + it doesnt even get the tone of voice right all the time.

Note that the prompt used before the inputs is exactly the same in both version.

At this point I would like to ask you if you have any advices or suggestions on how to best deal with this use case.

Thanks in advance!

jr.2509 · October 8, 2024, 8:28am

Hi there and welcome to the Community!

In general, this is a great use case for fine-tuning and should work in principle.

What temperature setting are you applying when using the fine-tuned model? Issues often arise when users use too high temperature for the fine-tune. I am wondering if this could be root cause in your case?

You may also consider further adjusting/specifying your prompt to target specific undesired behaviour that are common across your outputs.

trentingianluca1 · October 8, 2024, 8:42am

I am using 1.2 as temperature, do you think it’s too high?

I am wondering if I finetuned the correct way: given that the input of every request would be conversation+draft, I reproduced the exact same inputs in the finetune database while adding a prefectly formatted response.

jr.2509 · October 8, 2024, 8:43am

Yeah, the temperature is likely the cause of the issue. Try with a value of no more than 0.5 instead.

The data set you used sounds fine.

Let us know how it goes.

trentingianluca1 · October 8, 2024, 8:50am

ok will try with 0.2 and I will let you know how it goes.

The prompt used both in the finetune database and regular request is pretty long: 742 tokens. Do you think it’s too long?

jr.2509 · October 8, 2024, 8:50am

I don’t think it’s too long or let’s say I don’t think it is what is causing the issue.

Topic		Replies	Views
Fine tuned model's response is not lengthy/detailed API fine-tuning , fine-tune	4	506	February 23, 2024
Fine-tuning a model so it adopts style and tone of voice API fine-tuning	8	824	September 20, 2024
Fine-tuning a davinci model to write newsletters API chatgpt , fine-tuning , api	2	754	June 26, 2023
Struggling with poor performance on fine-tuned davinci model API	15	2677	December 20, 2023
Fine-tuning an OpenAI GPT-3 Model via API for Company Language API fine-tuning	5	1298	December 17, 2023

A tool to rewrite and format a draft with company tone of voice

Related topics