I’m using gpt-4o + RAG to generate documents as they’re made in my company. It’s working well but I need it to achieve better results, that’s why I’m finetuning it and these are the kind of entries I’m writing in my .jsonl:
When model replied correctly with a well made document:
{“role”: “user”, “content”: “Make me a document within this info”},{“role”: “assistant”, “content”: “Right document”}
When my model fails:
{"role": "user", "content": "Make me a document within this info ..."},{"role": "assistant", "content": "Wrong document"},{"role": "user", "content": "You failed in the following points: .... Make it again"},{"role": "assistant", "content": "Right document"}
Is this how I’m supposed to do it?