Best Public OpenAI FineTune Datasets? e.g. Incorporating RAG & Relevant Info

Any exemplary datasets out there that have obtained great results?

I have trouble with my model overfitting to examples (e.g. regurgitating examples from prompt verbatim), loosing all quality over long chat history, and infinite repetition at times.