GPT-3.5-Turbo - Unable to prompt engineer Fine-tuned model

girish17019 · December 9, 2023, 8:53am

I have fine-tuned gpt-3.5-turbo on 5000 conversations from a dataset with a collection of restaurant reservation conversations. I have used the default settings for fine tuning.

I now want to expand the capability of the model to ask the caller’s email id before ending the conversation ( not available in the dataset conversations). I have tried changing the prompts but no matter what prompt I use the model only responds the way of the conversation flow in the dataset. How do I address this issue? Does fine tuning cause the model to be inflexible?

_j · December 9, 2023, 9:01am

The term is overfitting.

Yes, this can happen. Especially with the default epochs parameters, which OpenAI seeming set high enough to allow small fine-tune training files to overcome the massive chat tuning that gpt-3.5-turbo comes with.

You might have included a system prompt that you also use in practice. One way you could break away from the fine-tune in select instances is to fill the AI context with a whole new system prompt that acts as a different identity.

The actual behavior you want, the AI randomly interjecting “by the way, what’s your email” is its own problem, and one that is not typical of a chatbot. You could try your techniques for that on plain gpt-3.5-turbo and see if that isn’t its own basket of kittens.

Topic		Replies	Views
Prompt Usage for Fine-Tuned Models Community gpt-35-turbo , fine-tuning	1	2240	January 4, 2024
Fine tuned model's response is not lengthy/detailed API fine-tuning , fine-tune	4	504	February 23, 2024
Avoid overfitting during the fine-tuning of gpt-3.5 turbo API gpt-35-turbo , fine-tuning , fine-tuning-problems	4	3000	December 21, 2023
Overfitting when giving samples in prompts Prompting	10	1270	December 20, 2023
Fine-tuning only available for 'base models'? API	6	1522	December 23, 2023

GPT-3.5-Turbo - Unable to prompt engineer Fine-tuned model

Related topics