Where can I find information about validation_file used in fine tuning?

tregoning · October 25, 2023, 4:14pm

I am trying to find documentation/examples of what should go in validation_file used when creating a fine tuned model.

The API docs point me to the fine-tuning-guide: OpenAI Platform

The fine-tuning guide points me to the API

Any pointers would be greatly appreciated, I am using ‘gpt-3.5-turbo’ and chat-complitions format in the training data

Thanks!

_j · October 25, 2023, 4:51pm

The validation file should be the same type of example conversations as you are training on, in the same type of file format.

The held-out examples should be of the quality where you could shuffle all your questions randomly and put any 10% of them into a validation file.

The validation file lets you see a second benchmark produced during fine-tune: not just how much the learning on the training set has progressed, but how well similar questions are inferred.

There can be a point of over-training or over-specialization where the AI no longer works as well on those similar questions it has not seen before, by being fine tuned to write only what you gave it.

Topic		Replies	Views
Understanding Validation file and Fine tuning file API gpt-4 , gpt-35-turbo , chatgpt , api	3	2097	November 21, 2023
What is validation_file for? API fine-tuning	8	1206	April 17, 2024
What's the point of providing a validation file for fine-tuning? API fine-tuning	4	3264	October 21, 2023
What's the best train/validate split for fine-tuning? API gpt-35-turbo , fine-tuning , api	3	758	November 25, 2023
"validation_file" in Create Fine-Tuning Job API gpt-35-turbo , fine-tuning	9	1847	October 9, 2023

Where can I find information about validation_file used in fine tuning?

Related Topics