Continuously fine-tuning a model with more data over time

damp · May 20, 2023, 8:56pm

I’m doing a project where I plan to host a sql database where I store all the completions I’m using so that they’re viewable to the people I’m sharing this project with. I plan on accumulating many more completions over time and want to re-train the model with batches of new completions. I just plan on exporting data from my sql database to a jsonl, but would reusing some of those same completions impact the training any? should I make separate jsonl files for each new batch of completions or train a new model with the full set each time? or can I just retrain with all the completions each time?

gugeniu666 · May 20, 2023, 9:44pm

I also have the same idea and hope to get professional advice!

80 · May 20, 2023, 10:47pm

You would take a base model. In the future this might be GPT-4, when OpenAI releases the API for that model. So you would essentially start with your own model which is a “copy” of GPT-4. Let’s call it damp_01. You fine tune it with all 700 entries in your json file. After some time you will have added another 300 lines and now have a combined set of 1000 entries. You can continue training model damp_01 by just providing it your new 300 entries.

Later, some time in the future, OpenAI will release GPT-4.2. Now you create a new model damp_02 which will essentially be a copy of GPT-4.2. You begin to fine-tune it — you should be providing all 1000 entries and start from scratch.

SomebodySysop · May 21, 2023, 12:34am

According to this OpenAI documentation: https://platform.openai.com/docs/guides/fine-tuning

Fine-tuning is currently only available for the following base models: davinci , curie , babbage , and ada . These are the original models that do not have any instruction following training (like text-davinci-003 does for example).

Am I missing something?

80 · May 21, 2023, 12:39am

Hi @SomebodySysop, I guess you are right if you want to express that fine-tuning is not available for GPT-4. This is why I wrote that it might be GPT-4 in the future.

Yet, we could predict that fine-tuning will be one day available and until then we can already begin to assemble our json files for this purpose.

Topic		Replies	Views
Continuous fine-tuning - Best Practices? API	5	4133	November 22, 2024
Can you iteratively train a fine-tune model? API	14	3611	September 20, 2024
Finetuning with New Data on Existing Finetuned Model API	6	2282	February 28, 2024
Request: continuous finetuning? API	9	1074	October 19, 2022
Fine-tuning using GPT-4 Beta API	18	25836	December 12, 2023

Continuously fine-tuning a model with more data over time

Related topics