A validation file is examples of the same quality as your training that are held out.
At the end of batches during fine-tuning, the generation is run on the trained model vs the validation examples to find the deviation by token scoring.
One can chart the progress to find where the AI model has become optimized for the full breadth of questions, while not becoming over-trained on specifically the inputs and outputs that are just in the training set.