Why are "Training loss" and "Validation loss" so high

jr.2509 · June 19, 2024, 8:03pm

Hi there!

It looks like you are intending to use fine-tuning for Q&A. Unfortunately, this is not what it is intended for - fine-tuning within this specific context - is primarily designed to get a model to behave in a certain way, e.g. adopt a certain style in output or approach a task in a specific way. It is not a recommended way to get the model to absorb specific knowledge.

As a similar topic came up just a few days ago, I’ll refer you to another thread where I have discussed this at greater length along with references to other resources.

Bottom line is that you should be looking at retrieval augmented generation (RAG) approach for your use case.

Let us know if you have any follow-up questions once you’ve had a chance to review the other thread.

Good luck in any case.

Topic		Replies	Views
Training loss=good, Validation loss=good API fine-tuning , api , fine-tuning-problems	8	4024	April 5, 2024
Poor fine-tuning results of GPT 3.5 API	3	1053	February 21, 2024
Fine Tuning for the first time API	3	82	December 4, 2024
Help with fine-tuning, think I'm over-fitting, but not sure API fine-tuning	7	2228	December 21, 2023
Questions about fine-tuning GPT-3.5-turbo API fine-tuning	1	2103	October 29, 2023

Why are "Training loss" and "Validation loss" so high

Related topics