Davinci-002 fine-tuning completion does not work

fisher · March 26, 2024, 11:07pm

Hi.
I have not used ChatGPT since legacy models were deprecated. However, as far as I know, OpenAI API still supports completion for davinci002 and babbage002. So, I fine-tune them on my old dataset where prompt is article and completion is a place where it takes place. You can see example below:

{“prompt”: “In Prague, a scooter driver was caught who drove under a "brick" in the city center twice. ->”, “completion”: “{"result":["Czech Republic, Prague, city center"]}”}

But when it comes to executing this model, it generates text that does not differ much from non-fine-tuned ones.

What am I doing wrong? How to fix it?

mustafaakben · March 26, 2024, 11:47pm

I had a similar issue a few years ago. The problem was the trailing or leading whitespaces in the training dataset. I noticed that when I was turning data to jsonl, I made some mistakes etc.

Second, try to reduce the temperature to very low, such as “0,” and see whether you are still getting the same non-sensical outputs. I use a temperature of around .60 for those models.

fisher · March 27, 2024, 12:40am

Thanks for your help.

Unfortunately, i have also faced such problem due to the whitespaces on the same dataset before, but i fixed it and it worked on legacy models. So my problem may not be related to dataset itself, but I will to review it anyway.
Lower temperature does not change anything

_j · March 27, 2024, 1:42am

You can include the prompting that leads to completion output anyway, and just use the fine-tune as a quality improvement.

prompt:

Here is an AI that reads the input text, and then as output, it performs entity extraction, extracting any unique geographic locations mentioned in the text as a descriptive JSON array of strings result, written as natural language strings (or an empty array for no results). The JSON closes with ]}

// input text:
In Prague, a scooter driver was caught who drove under a “brick” in the city center twice.

// output JSON:
{“locations”: [

The prompt (actual prompt engineering) puts the AI in a mindset of “completing” a task, and we even start writing the output for it as a “prompt” separator, and use the pattern to stop it when it loops the pattern. I even show below using the closing of the JSON as a stop sequence to save you any extra tokens.

You can see the performance still needs your fine tune to be as descriptive as you imagine, identifying a singular location and not filling a list - or to not emit latitudes, or to not keep on writing but instead emit its “stop”.

fisher · March 27, 2024, 8:53am

Thanks for your attention!
I don’t think this is a best solution, because we always have to pass all the instruction to GPT. It looks like we pay more for an instruction than for sentence analysis itself. Moreover, it worked just the way i want on the same dataset for old models

Topic		Replies	Views
Fine Tuning text completion model with Davinci-002 using blank prompts API fine-tuning , fine-tuning-problems , fine-tune	2	526	February 29, 2024
Fine-tuned davinci - messed up completion Prompting	1	699	July 11, 2023
Struggling with poor performance on fine-tuned davinci model API	15	2677	December 20, 2023
Finetuning not working as expected API fine-tuning	1	737	July 11, 2023
Fine tuned model providing worse output Prompting	6	2064	March 7, 2023

Davinci-002 fine-tuning completion does not work

prompt:

Related topics