Fine-tuned model unable to answer prompts from training data

kevin.guimard · May 31, 2023, 1:54pm

Hello,

I’ve followed the tutorial about fine-tuning and was surprised that the model (based on davinci) could not answer correctly the questions present in the jsonl file.

For instance, the jsonl file contains:
{“prompt”:“What is the color of the XXX heat pump ?”,“completion”:" Blue\n"}

But the response given by the model to the very same question is “It is black”.

It seems that the model is using exterior knowledge. How can it be made to exploit this training data in priority?

Regards,
Kevin

anon22939549 · May 31, 2023, 2:28pm

Fine-tuning teaches the model howto answer.

Embeddings provide a way for models to access information outside their initial training data.

joyasree78 · May 31, 2023, 2:47pm

Hi Kevin, how many epochs did you use. I found in some cases if I increase the epochs, it starts referring to the training completions

kevin.guimard · May 31, 2023, 3:20pm

Do you allude to the “embeddings” approach where the prompt is matched with different corpuses of texts? The problem with this approach is that the response to the prompt has to be found within the selected corpus while some prompts may address many different corpuses.

kevin.guimard · May 31, 2023, 3:21pm

I only ran the tutorial, and the number of epochs was 4 by default. I’ll check how it can be modified.

keithaw · May 31, 2023, 3:59pm

How are you training your data? i.e. how are you training the model to know that the XXX heat pump is blue?
If it’s just from the Q&A samples alone, it probably won’t help - as @elmtedt noted, you’re essentially just training the model to respond a certain way rather than “understand” why the XXX heat pump is blue.

If you want to fine-tune the model to understand different heat pumps and their properties (colors, etc) I think there are two options:

If the data you want to train is concise enough, put it into each training sample prompt. For example you could start each prompt with a list of heat pumps and their properties. Then the Q&A training would be able to refer directly to that data and associate the answers with it. The downside to this approach is that the latest fine-tuned models (i.e. davinci) only allow up to 2048 tokens for prompt+completion which may not be enough to fit all of your domain knowledge.
If you need to draw from a larger corpus of text about the subject matter (heat pumps), then you should try the sequential fine-tuning approach outlined in OpenAI’s draft guide. That is, first train with unstructured documents that describe your domain knowledge about heat pumps. Then, do another fine-tune job of that pre-trained model with the Q&A. This will “teach” the model to associate the XXX heat pump with the XXX heat pump it learned about from unstructured data.

kevin.guimard · May 31, 2023, 4:39pm

I just followed the tutorial about fine-tuning models, so I just used a jsonl file as explained in my post. My intent was not to make the model understand why the heat pump is blue, just to learn by heart that it is blue. Following @joyasree78’s advice, I increased the number of epochs to 50, and the model now give the right answers, followed by gobbledygook. I still need to investigate that.

The sequential fine-tuning approach looks interesting, I guess that learning from unstructured documents would allow the model to learn the business language. But how can you train a model on such data? I haven’t found any section about model training in OpenAI guides.

anon22939549 · June 1, 2023, 1:48am

Look into Hypothetical Document Embeddings (HyDE).

jwatte · June 1, 2023, 1:58am

I don’t see how that proposal (generating a hallucinated document to generate an embedding to then do embedding based retrieval) would help with the original problem of “fine tuning a few steps doesn’t losslessly encode knowledge into the model.” It looks to me like a totally different mechanism, solving a totally different problem?

sarthak.srivastava · June 1, 2023, 5:28am

yes if u increse the epoch
it will increase the accuaracy of model
makr sure epoch should be above 8

Topic		Replies	Views
Fine-tuning doesn't seem to improve quality for me API fine-tuning	6	1037	June 10, 2023
Generic Answer when Fine Tuning OpenAI Model (for questions not in prepared dataset) API fine-tuning , text-davinci-003	18	3321	May 17, 2023
Strange behavior of a fine tuned model API	6	1984	December 20, 2023
Struggling with poor performance on fine-tuned davinci model API	15	2683	December 20, 2023
Davinci not learning new patterns after fine-tuning, forgetting to answer questions API	6	872	December 20, 2023

Fine-tuned model unable to answer prompts from training data

Related topics