Fine tuning lower cost models

adamk22 · February 9, 2023, 1:27am

Currently I’m using davinci (latest) to generate long-form completions for an app I have created but it’s eating up funds quite quickly. I’ve been looking into alternative models like curie and babbage but the output is clearly of much lower quality compared to davinci. I’ve also looked into fine-tuning, and I was wondering if I were to fine-tune curie or babbage for my specific use, could I get davinci quality output for my specific need? Obviously, I assume the more training data you feed it, the better, but I was wondering if anyone has any experience with this and can tell me the quality you can achieve with fine-tuned models?

PaulBellow · February 9, 2023, 1:37am

Maybe… but probably not. Too many variables to say for sure, but likely not. Fine-tuning is really hit and miss (depending on a lot of variables…)

I’d recommend a quick search of the forums for similar issues and discussions about fine-tuning.

Hope this helps.

nunodonato · March 14, 2023, 11:34am

I am fine-tuning curie for my project and the results are great! (chatgpt uses the same number of parameters as curie, so don’t underestimate it) - you just need good training data (quality+quantity)
I am now making tests with babbage as well, but I’m guessing that will be no-go…let’s see

Topic		Replies	Views
Fine tuning - how exactly does it work? API	6	1637	December 23, 2023
Fine-Tune Davinci to write programming language API codex	2	835	May 3, 2023
How to reduce the cost of model training with davinci? API api	11	486	January 30, 2024
Fine tunning text-davinci-002 API	2	1089	December 24, 2023
To fine-tune or not to fine-tune? that is the question Prompting	9	2085	April 20, 2024

Fine tuning lower cost models

Related Topics