Fine tuned model doesn't perform on production environment

cuitingzhao · August 29, 2023, 3:56am

I just fine-tuned the 3.5-turbo-0613 yesterday, and see great improvement when I used the fine-tuned model on the playground and call from my local environment. But when I deployed it into our live production, it didn’t perform at all. The reason I fine-tuned is to give it a different tone to make it speak more concisely and orally which I can’t achieve by prompt engineering. So back to the question, has anyone met the same issue? How did you resolve it? Many thanks in advance!

_j · August 29, 2023, 4:02am

Are you giving your AI the same system prompt as your fine tuning in API also?

Have you made chatbot software that continues to feed previous conversation back as user/assistant roles so the AI knows what you were previously talking about?

cuitingzhao · August 29, 2023, 4:18am

Thanks for your reply! Yes I gave the same the system prompt as my fine tuning in API. And I did feed the previous 6 conversations so AI knows what I previously talked about.

_j · August 29, 2023, 4:23am

Are you getting any evidence that your fine-tune model is actually used? Forgetting to specify the model or it being used wrong could give an AI that only follows the system prompt. The full API response includes the model used.

Replicating all playground parameters at temperature 0.1 and top_p 0.1 should get you nearly identical responses. “Generate code” in playground gives a python example with the parameters.

And if it wasn’t clear: you must make a system role message that is permanently inserted as the first to replicate the “system” box of the playground, and will invoke the behaviour you have tuned.

cuitingzhao · August 29, 2023, 4:44am

Yes the difference of tone of reply to the same question between playgroud/local and production is very obvious so I think it’s evident enough.
I can also ensure all the parameters are the same. System message is also the same.

PaulBellow · August 29, 2023, 4:50am

Feel free to post some code, and we can take a look.

Examples are helpful to root out possible problems…

cuitingzhao · August 29, 2023, 6:20am

Thank you everyone! By looking at the log I realized there is some extra message attached to the system prompt
Sorry to take your time answering. I will be more careful next time!

Topic		Replies	Views
Finetuned a model, but it replies like insane API	7	1172	December 24, 2023
What the theory of GPT Finetune? The result looks not so good API chatgpt	4	855	October 23, 2023
Fine tuned model produces responses that make it seem like it hasn't been fine tuned at all API fine-tuning , fine-tuning-problems	1	1511	September 14, 2023
Fine tuned with wrong data initially API fine-tuning-problems	11	1458	December 23, 2023
Hallucination after fine tuning API api	5	265	August 13, 2024

Fine tuned model doesn't perform on production environment

Related topics