Fine tuning - how exactly does it work?

Xeen · April 12, 2023, 5:12am

Hey all,

As I understand it, fine-tuning uses a completely fresh/untrained model rather than extending of an existing trained one, is that correct?

I have some input/output that is sometimes incorrect, but not woefully so. It’s more that the API isn’t adhering to the prompt engineering in some cases (which ultimately does break my application). I was intending to use fine-tuning for this purpose, but I thought it would “integrate” with the existing, already-trained language model (in this case, davinci) to help improve responses. I simply don’t have thousands of data points to train upon.

ChatGPT4 seems to be much better at it, but that might be quite some time before I have API access to it.

Are my assumptions correct? And can anyone give me any advice?

sps · April 12, 2023, 5:38am

Hi @Xeen

Fine-tuning, as the name suggest, fine-tunes existing pre-trained models to improve completions with minimal prompting. Currently only base models ada, babbage, curie and davinci support fine-tuning.

You’ll find more info in docs

Xeen · April 12, 2023, 5:45am

Hi @sps

Thanks for the response. I’m confused, as the little fine-tuning I did do seems to produce worse results than without the fine tuning (also at at a higher token cost). I assumed this was because I was starting off with a “vanilla” base and then working off that. It may be that I simply don’t have enough data points to fine-tune on. Still, I would think that it couldn’t get worse from my own additions.

sps · April 12, 2023, 5:58am

A good Fine-tuning requires sufficient number of prompt-completion sets, proper formatting and appropriate hyperparms.

Can you describe you use-case?

udm17 · April 12, 2023, 5:58am

The issue with fine-tuning without have a lot of datapoints is that the effects don’t show cause compared to the original size of the modele, the fine-tuning might be miniscule. Open AI research says that the performance scales when the number of fine-tuning parameters are doubled, so lack of data would really effect the performance, especially if it is based off of base-davinci. You might be better off trying to prompt engineer your task and you the few points you have as samples for the prompts to the llm.

Xeen · April 12, 2023, 6:31am

Thanks for your help guys.

I’ll likely look at re-designing my prompting in this case and maybe split it into some smaller tasks.

Topic		Replies	Views
Fine-tuning only available for 'base models'? API	6	1495	December 23, 2023
Gpt-4o-mini fine-tuning with only 10 lines of code API gpt-4o-mini	27	2481	September 5, 2024
Are fine-tuned models a good way to give GPT a specific tone of voice? API api	5	3926	July 20, 2023
Why does fine-tuning not work but Assistants do? API	6	284	June 5, 2024
Fine Tuned Chatbot forgets how to output summary of conversation API	9	1846	December 18, 2023

Fine tuning - how exactly does it work?

Related topics