Fine-tuning only available for 'base models'?

clashzz · June 24, 2023, 10:03pm

Hi

The OpenAI documentation about fine tuning states ‘Fine-tuning is currently only available for the following base models: davinci, curie, babbage, and ada. These are the original models that do not have any instruction following training (like text-davinci-003 does for example).’.

Can anyone help me understand what exactly this means? Specifically, my questions are

Does the last sentence in the quote above means that you cannot ‘prompt’ the fine tuned model like you do with gpt-3.5-turbo?
Following the above, does that mean that the fine tuned model can only do the task you fine tuned it for? I.e., the same as whatever ‘prompt/completion’ pairs you used for fine tuning?
The base models are quite outdated but I may be wrong… however, if that is indeed the case, what would be a use case to use fine tuning other than the more advanced prompting models like gpt-3.5-turbo, gpt 4?

Thanks

anon22939549 · June 25, 2023, 2:19am

You can prompt a fine-tuned model exactly as you would any other.
You can ask a fine-tuned model to do anything you want, how well it will do what you ask is an altogether different question.
The point is control over the response. Fine-tuning will have a strong influence on how the model responds to the prompt.

clashzz · June 25, 2023, 8:50am

Thanks for the insights, very helpful!

jochenschultz · June 25, 2023, 8:53am

If you want to feed specific knowledge into the model e.g. to search stuff you might want to have a look into embeddings instead.

clashzz · June 25, 2023, 1:30pm

Thank you. I read about it and I think my use case of email generation given some prompts/parameters isn’t a good fit for embedding based approaches. But do correct me if I am wrong.

What I need to do is to describe certain ‘characteristics’ of an email (e.g., emotion, sentiment, story focus, purpose etc) and ask GPT to create an email. We have a lot of such data that can be used as training data for fine tuning, imagine one example being a given email, and multiple labels (over 20) assigned to it. It seems to me that prompting may be the best way to start, but fine tuning may be useful to teach the model what each characteristic means…

Thank you for your insights again.

jochenschultz · June 25, 2023, 2:02pm

That’s right. To change the tone, etc. you might have to finetune. Maybe even a combination could be useful (using two different models for that would be cheaper i suppose).

Topic		Replies	Views
Fine tuning - how exactly does it work? API	6	2434	December 23, 2023
Fine-tune GPT-3.5 Turbo for custom use cases API fine-tuning	7	1764	December 24, 2023
Prompt Usage for Fine-Tuned Models Community gpt-35-turbo , fine-tuning	1	1994	January 4, 2024
Are fine-tuned models a good way to give GPT a specific tone of voice? API api	5	3631	July 20, 2023
Fine Tuning text-davinci-003 Models Prompting	10	5445	March 26, 2024

Fine-tuning only available for 'base models'?

Related topics