Using fine-tuning for operational report generation

joern · April 15, 2023, 1:44pm

Hey there,
for a few days I’m trying to develop an application that will generate operational reports based on some specific parameters.

Therefore I scraped about 400 press reports from a public accessible page and made a completion request with text-davinci-003 to extract the object with the specific parameters as JSON Object. So far so good - in my opinion this worked better than expected.

BUT then I removed some really dirty reports from the training data and collected the left press reports.
I generated a prompt similar like that for every press report:

Parameters:
{PARAMETER_OBJECT}

Press Report:

I combined the prompts and the completions (scraped press reports) to a jsonl file and created a fine-tuning (davinci, 2 epochs) with this file.

When I then tested the model the output is totally crap, often it repeats sentences endlessly (after I increased frequency penalty this stopped) and the worst thing that it hallucinates a lot even with 0 temperature. It writes instructions for the use of photos of a press report and other things. This comes as the training data contains some sections that couldn’t be derived by the input parameters.

So my final question: Do you think fine-tuning is the correctly/best chosen option or should I use simple completion? And if fine-tuning is the best option, how should I improve training?

Topic		Replies	Views
First Fine Tune was kind of disappointing? API	1	526	February 6, 2024
Fine-tuning a model without using prompt-completion API fine-tuning	1	921	July 4, 2023
Fine tuning - how exactly does it work? API	6	2574	December 23, 2023
Finetuning not working as expected API fine-tuning	1	737	July 11, 2023
Fine-tuning and worse results that base Davinci API	8	1248	January 21, 2024

Using fine-tuning for operational report generation

Related topics