Hi . For a few days I’ve been doing some fine-tuning of GPT 3.5. I am trying to teach it some new concepts about Machinations (an economy design tool). By default, GPT 3.5 has some rather unclear answers regarding how to do certain things in Machinations.
My worst “nightmare” is that the model thinks that Machinations has an Unreal Engine plugin. Machinations DOES have a plugin for Unity 3D, but not for Unreal Engine. And I’m trying to teach the model this basic fact. + a bunch of others where it gives unclear/vague answers (which is expected because Machinations is rather niche - although it DOES know about it and has some decent answers by default).
So, I generated a relatively good set of data for training it, based on the Machinations manual. The data is of course in JSONL. The largest set sits at 1533 examples, which I think is a fairly good chunk of data.
Unfortunately, even after 20 epochs of training, and 2,487,300 tokens, the model still gets wrong a very basic notion that was actually repeated a few times (with different formulations) in the training data.
I ask it if Machinations has an Unreal Engine plugin and the answer is almost always “yes”.
This is how my 20 epochs training looks like. Maybe I’m missing something?
Then, I even tried a fascinating 32 epochs but this time with a smaller file, with just 15 examples in which I basically repeat to it like a mad parrot that THERE IS NO UNREAL ENGINE PLUGIN.
Even after that, it continues hallucinating that there IS, plus, after the 32 epochs training many of the other answers go haywire (like, worse hallucinations).
This is how the smaller file, 32 epochs training looks like:
I would really appreciate some guidance here before I sink more money into this. Spent about $100 so far.