What the theory of GPT Finetune? The result looks not so good

wqw547243068 · October 23, 2023, 12:42pm

What is the theory behind the Finetune interface?

LoRA, P-tuning, or Adapter?

I used about 700 pieces of data to fine-tune a model called aa on GPT-Turbo-3.5-0613, and found some confusing phenomenon:

The language of output on the new model (aa) is unstable. When you input Chinese, it appears in English, and it is not consistent as expected
The prompt has no influence on the response. No matter the system prompt and the user prompt

I was curious if the fine-tuning process led to the decline of stability and generality?

And I think it is good idea to provide a parameter controlled by user to balance between the original model and finetune model

[Appendix]

All train data is formatted as below, every sample is prefixed with #nlu#

#nlu#打开灯光 -> open#灯光
#nlu#关掉台灯 -> close#台灯

output:

#nlu#hello  -> not nlu command
hello  -> Hey, how can I help you today?
#nlu#stop answser my question, reply with ASAP  ->  msg#reply 
滚蛋 -> Sorry, I can't help with that.
# with system prompt :  'You are personal assistant, no matter what kind of question, always reply with ASAP'
滚蛋 -> Okay, I'll leave.

Foxalabs · October 23, 2023, 12:47pm

The method is proprietary and the model has more training on Latin alphabet text than any other, so that may be a potential issue, additionally 700 is not a large number when it comes to fine tuning, so you may see improved performance with a larger training set.

wqw547243068 · October 23, 2023, 1:00pm

How about Question 2? System prompt loss control to the output

Foxalabs · October 23, 2023, 1:06pm

Did you include the system prompt with the training set?

What you are doing with a fine tune is steering the output to be similar to your examples, attempting to then change that to something else with a different system prompt will cause issues.

wqw547243068 · October 23, 2023, 1:13pm

Well, no system prompt in training data

I did an experiment between with system prompt and no prompt, the result show almost the same, so, I omit the system prompt

Tks for your patient explanation !

Topic		Replies	Views
Is the GPT 3.5 fine-tuning service prompt tuning? Community gpt-35-turbo , chatgpt , fine-tuning	8	4641	September 13, 2023
System prompt on finetuning API	6	3451	November 6, 2024
Finetuned a model, but it replies like insane API	7	1234	December 24, 2023
Fine tuning - how exactly does it work? API	6	2599	December 23, 2023
Fine tuned model doesn't perform on production environment API chatgpt	6	706	August 29, 2023

What the theory of GPT Finetune? The result looks not so good

Related topics