Fine-tuned model vs last checkpoint

arstamyanarthur5 · April 6, 2024, 9:31am

I fine-tuned gpt-3.5-turbo-1106 and now want to fine-tune on the fine-tuned model
When I want to select model I have options to select checkpoints or just the model. When I select last checkpoint as a base model to start fine-tuning it says it isn’t available but selecting just the fine-tuned model as a base model is okay and available. Now I want to know what the difference between the fine tuned model and its last checkpoint is. Is it better to start fine-tuning from fine-tuned model or its last checkpoint or its the same thing?

_j · April 6, 2024, 3:25pm

OpenAI says:

OpenAI will create one full model checkpoint for you at the end of each training epoch

You can check and report back if there is a checkpoint that duplicates the final state:

List checkpoints - get the count of checkpoints
List fine-tuning job objects - get the n_epochs hyperparameter of the job

If they match, then the last checkpoint is the same as the full model.

If checkpoint count is one less, than there is no checkpoint corresponding to the final model.

In either case, the option you have for continuation is: “you can pass the name of a fine-tuned model into the model parameter when creating a fine-tuning job.” It is unlikely you’d want to start from an earlier checkpoint and continue if that were available, unless you are training on a whole new set of examples that will be activated separately.

Topic		Replies	Views
Is there any differences or specific reasons between fine-tuning a base model and fine-tuning a fine-tuned model? API	2	1309	July 31, 2023
How is the fine-tuned model picked? API fine-tuning	1	292	February 19, 2024
Continue fine-tuning from a fine-tuned model Announcements	15	5964	February 16, 2024
Some questions about fine-tuning from fine-tuned model API	1	375	January 21, 2024
Fine tuned model does not exist! API gpt-35 , fine-tuning , api	12	3498	December 9, 2023

Fine-tuned model vs last checkpoint

Related Topics