Fine tuning Completions are cut

fcarbain · April 7, 2022, 5:09pm

Hello,

I just tried to train a dataset with davinci (not big, 50 lines). In my dataset, I put a stop at the end of each prompt \n\n###\n\n
and I put a different stop at the end of each completion ### like the guide says.

The model is trained and is working.
Nevertheless, I have a problem. When I send a prompt (with open ai CLI), the completion I receive consists of the prompt + the start of a sentence (the real completion I want) which is cut.

Why does it repeat the prompt ? Do I need to put a variable in my command line to make it use more tokens ?
Do I need to put a stop \n\n###\n\n at the end of the prompt I send ?

Thanks all for your help !!

daveshapautomator · April 7, 2022, 7:53pm

I prefer to use natural language to demarcate prompt from completion. That works very well and consistently for me. Symbolic tokens like ### are semantically meaningless, and if you use them multiple times, the model might be getting lost.

luke · April 7, 2022, 9:37pm

Can you share a sample line from your dataset and a sample prompt? (DM works if preferred!)

fcarbain · April 13, 2022, 6:39pm

I changed my separator and prompt to [s] and [e] and its working better. The separator and stop suggested in the documentation seems wrong to me because you have ### in both !
Thanks to luke for his help !!

Topic		Replies	Views
Stop sequences being ignored API	5	2397	February 24, 2023
Fine-tuned model in a chatbot gives responses for both the chatbot and the user API	5	2367	March 16, 2023
Fine Tuning text completion model with Davinci-002 using blank prompts API fine-tuning , fine-tuning-problems , fine-tune	2	515	February 29, 2024
Fine-tuned davinci - messed up completion Prompting	1	697	July 11, 2023
Fine-tuning issue: after fine tuning each ciompletion keeps repeating API	1	535	June 12, 2023

Fine tuning Completions are cut

Related topics