Fine-tuning problem, multiple completion

willycha0119 · March 6, 2023, 2:15am

I am preparing jsonl

I want to train the model to use the prefix of the text to predict the sentence I want to type

For example

“The weather is nice today”

So when I type “t”, “w”, “i”, “n”, “t”

I hope he can answer me “The weather is nice today”

But it could also be “That’s why I need teamwork”

So I prepared the dataset as below

{“prompt”:“t w i n t”,“completion”:“The weather is nice today”}
{“prompt”:“t w i n t”,“completion”:“That’s why i need teamwork”}

Is this preparation in the right direction?

Or is there something I need to modify?

ruby_coder · March 6, 2023, 2:22am

No. Your data is JSONL compliant but it does not meet the OpenAP data formatting requirements for fine-tuning.

Reference:

Topic		Replies	Views
Trying To Fine-Tune To Overcome Prompt Size Limit API	4	1446	December 17, 2023
Fine Tuning text completion model with Davinci-002 using blank prompts API fine-tuning , fine-tuning-problems , fine-tune	2	524	February 29, 2024
Fine tuning DaVinci , Need help finding prompt ideas API	2	573	April 25, 2023
Fine tune model problem API	6	778	January 10, 2023
Got awful results after fine-tuning API	11	3209	December 1, 2022