I am getting non related answers from my fine-tuned model!

oham · November 6, 2024, 2:03pm

I tested a set of conversation (prompts, user inputs, and assistant messages) in the Playground, and the model responded as expected. Then, after creating a JSONL file based on the documentation and fine-tuning a model, the new model’s responses seem uninformed, as if it lacks knowledge of the prompts and all the examples provided.

Any ideas on what might be causing this?

_j · November 6, 2024, 2:41pm

Your problem is “creating a JSONL file based on the documentation”, and expecting that to instill understanding of documentation.

An AI language model produces entailment, predicting an output based on an input, along with its pretraining (and OpenAI post training) shaping how patterns are expected to be followed.

Send the exact same system message and user message as constructed in lines of your fine-tune file. To then receive an assistant response like the fine tune file had the assistant producing. If you do not use the inference input the same way as you trained, or you did not train on exactly how the AI is expected to be used, even in growing conversations as examples, then you cannot expect to receive the desired behavior.

You will also find on the forum that the correct way to answer from documentation is to use an external retrieval solution, that new knowledge is not the kind of thing fine-tune will be good for (unless you can produce beyond thousands of example completions on that knowledge.)

Topic		Replies	Views
Fine-Tuned Model Not Responding with Expected Answers API	2	289	November 6, 2024
Custom model response not aligning with training datasets Community gpt-4	1	59	January 23, 2025
Fine tuned model produces responses that make it seem like it hasn't been fine tuned at all API fine-tuning , fine-tuning-problems	1	1544	September 14, 2023
Why do some problems after fine-tuning the large model not match the answer API fine-tuning , api , fine-tuning-vs-rag	3	376	August 11, 2024
Fine tuning doesn't bring relevant completions API fine-tuning	4	682	December 24, 2023

I am getting non related answers from my fine-tuned model!

Related topics