Fine-Tuning Model Performance – Seeking Solutions

raghav77 · February 6, 2025, 3:43am

We built a personified AI chatbot that delivers personalized Vedic astrology readings by processing users’ birth chart data via an API.

We aimed to improve:

Simpler, More Human Language – Making responses warm, engaging, and easy to understand.
Conversational Variability – Reducing repetition and ensuring a more dynamic, natural flow.
Concise Output – Keeping responses brief and impactful.

Worsened Performance – The fine-tuned model performed worse than the original system prompt version.
Language & Tone Issues – Responses became unnatural, erratic, and sometimes incoherent.
Overall Degradation – The fine-tuned model did not deliver the expected improvements and was less effective than the system prompt approach.

Has anyone faced similar degradation when fine-tuning with user chat data?
What alternative strategies (e.g., refined prompt engineering, reinforcement learning, or hybrid approaches) could improve chatbot responses while maintaining the strengths of the system prompt model?

lu-wo · August 7, 2025, 12:01am

Did you still include your system prompt in the fine-tuning samples? Or did you try to fine-tune on pure user inputs and model outputs?

Topic		Replies	Views
Fine Tuned Chatbot forgets how to output summary of conversation API	9	1925	December 18, 2023
Fine-tuned GPT-4.1 / GPT-4o breaking context, looping, or clinging to training patterns API gpt-4 , fine-tuning , api	2	128	December 8, 2025
Fine tuned model produces responses that make it seem like it hasn't been fine tuned at all API fine-tuning , fine-tuning-problems	1	1607	September 14, 2023
Poor fine-tuning results of GPT 3.5 API	3	1318	February 21, 2024
Fine tuning - how exactly does it work? API	6	2709	December 23, 2023