Performance matrices of the finetuned model

Hello Everyone !!!
I recently started fine - tuning a model with gpt-3.5-turbo-1106 for my academic project. I am facing an issue for generating the performance metrics of the model such as accuracy, precision, recall, F1 score.

Can anyone help me with this…