Fine-tuning Announcement & Davinci Beta

luke · December 14, 2021, 11:46pm

Hi everyone!

As you may have seen, we publicly announced fine-tuning today: Customizing GPT-3 for Your Application.

A couple of updates:

Now that we’ve introduced pricing for training, we are lifting the file and training limits. You are free to use any file size and train as much as needed!
As of today, we’re offering fine-tuned Davinci as a beta feature.

We’re excited to see what you build!

nunodonato · December 15, 2021, 12:12am

Hi @luke
From the article, I understood that it is possible to do incremental fine-tunes, is that correct? So after tuning a model, we can upload a file and train again “adding” to that already tuned model?
Or did I get it wrong?

You can use an existing dataset of virtually any shape and size, or incrementally add data based on user feedback. With fine-tuning, one API customer was able to increase correct outputs from 83% to 95%. By adding new data from their product each week, another reduced error rates by 50%.

luke · December 15, 2021, 1:31am

Hi there, incremental fine-tuning is not currently available but we’re exploring adding it in the new year. The customer referenced trained new models with new data.

aloksinghjsr · December 16, 2021, 6:09pm

Hi @luke
When will davinci intruct be available for fine tuning?

luke · December 18, 2021, 12:25am

We’re exploring this, what use case do you have in mind? Or, put another way, why would you prefer this to fine-tuning the base Davinci model?

nunodonato · December 19, 2021, 9:05am

Not the OP, but I can think of one good reason:
to train the model in a specific API/Library (or even a custom made one where it has no knowledge)

probably would be also helpful to have it trained in specific code generation in order to save on the prompt size. In case of code, even a few-shot prompt can take up a lot of tokens quite fast
Edit: sorry, I misunderstood, thought it was about davinci-codex

rahul · December 20, 2021, 1:03am

DaVinci-fine-tune performs better than Curie-fine-tune on Accuracy. DaVinci is superior to Curie. Since DaVinci-Instruct is superior to DaVinci, wouldn’t DaVinci-instruct-fine-tune perform better than just DaVinci-fine-tune?

wissem · December 20, 2021, 5:19pm

Hi,
We tested both a fine tuned Davinci base model and the Davinci instruct, and the results from the latter were better on our data. However, we need it to answer questions from our own corpus only and in the voice of our product. This is why we thought of fine-tuning the Davinci Instruct. Is there any other way to achieve this?

Topic		Replies	Views
Can we fine-tune instruct-series GPT3 Models? API	9	992	March 26, 2024
GPT-3.5 Turbo fine-tuning now available (and new GPT3 models) API announcement , fine-tuning , api	18	15958	December 15, 2023
Fine-tune GPT-3.5 Turbo for custom use cases API fine-tuning	7	1752	December 24, 2023
Fine tuning text-davinci-3 API	5	1209	March 26, 2024
Fine tuning - how exactly does it work? API	6	2383	December 23, 2023

Fine-tuning Announcement & Davinci Beta

Related topics