I have a simple question about where to put each information in the structure of the dataset.
Let’s take the example of a dataset to remove the id and class from a html string.
What I am doing :
{"messages":[{"role":"system","content":"You are a html expert"},{"role":"user","content":"Remove all the class and id from the following html: here html string with class and id"},{"role":"assistant","here html string without class and id"}]}
It’s working when the task is easy. But, I wonder if the “prompt instruction” must be in the role system content instead :
{"messages":[{"role":"system","content":"You are a html expert. Remove all the class and id from the html"},{"role":"user","content":"here html string with class and id"},{"role":"assistant","here html string without class and id"}]}
I have trained several models and having great results but for a difficult task with 100 great examples, it does not work that great. So may be the second example is what I need to do.
Any advice would be much appreciated !
Thanks !