Fine-tuning Chatbot instructions not clear

JSON70 · April 4, 2023, 3:14am

Hi community,

I’m working on a training data set and trying to ensure its formatted in the right way but something doesn’t add up.

In the docs it shows the format with the prompt value ending with: Agent:

{"prompt":"Summary: <summary of the interaction so far>\n\nSpecific information:<for example order details in natural language>\n\n###\n\nCustomer: <message1>\nAgent: <response1>\nCustomer: <message2>\nAgent:", "completion":" <response2>\n"}

But that is not a unique suffix separator so i get this warning in the python CLI

- All prompts end with suffix `\n\nAgent:`
  WARNING: Some of your prompts contain the suffix `

Agent:` more than once. We strongly suggest that you review your prompts and add a unique suffix

But that is using the structure they recommend in the docs for ChatBot?

logankilpatrick · April 4, 2023, 11:21pm

You should not use fine tuning for a chatbot. Use GPT-3.5-turbo, the models that you can fine tune will not work well, they have no instruction following nor conversational data.

JSON70 · April 5, 2023, 2:08am

Thanks for your response Logan. Correct me if I am wrong, but you cannot fine-tune the GPT-3.5-turbo model is that right?

krisbian · April 5, 2023, 6:00am

yes you’re right, currently we only can fine-tune the base model like ada, babbage, curie, or davinci but not the latest model ones like text-*, GPT-3.5, or GPT-4.

If you do work in Chatbot or any QnA conversation you can combine embedding + text/chat(choose one) completion model. I think the nice simple tutorial to begin with by reading and doing some experiments like in this tutorials.

Then after you’re getting used with how this combined models work, you can read through this discussions to have an idea how embedding takes a role in chat completion model.

Topic		Replies	Views
Are fine-tuned models a good way to give GPT a specific tone of voice? API api	5	3806	July 20, 2023
Chatbot with a fine-tunned model API	6	1048	December 27, 2023
Finetuning for shortening prompts Documentation fine-tuning	10	3736	December 24, 2023
Customer Support Chatbot how to create train data and how to use the fine tune model via playgroud Prompting	3	1324	March 29, 2023
Issues with Fine-Tuned Babbage-002 Model Returning Incorrect Completions Prompting gpt-4 , chatgpt	13	1770	December 21, 2023

Fine-tuning Chatbot instructions not clear

Related topics