Any exemplary datasets out there that have obtained great results?
I have trouble with my model overfitting to examples (e.g. regurgitating examples from prompt verbatim), loosing all quality over long chat history, and infinite repetition at times.