I want to train a model so that it output specific JSON values in the correct format relative to user inputs. I’ve tried including instructions as the assistant output to my messages, but its clear now that that isn’t how fine tuning is designed to work. The reply from the assistant is simply my instructions formatted as JSON, it did no replace any of the instructions with actual content.
For example:
{"role": "system","content": "Return me a random Colour & Object in JSON format"},{"role": "user","content": "Tell me a random colour and object"},{"role": "assistant","content": "\"JSON\"{\"value1\":{\"content\":\"tell me a random colour\"},\"value2\":{\"content\":\"tell me a random object\"}}"}
Returns this:
"JSON"{
"value1":{"content":"tell me a random colour"},
"value2":{"content":"tell me a random object"}
}
I want to to return a random item without me having to define it:
"JSON"{
"value1":{"content":"Blue"},
"value2":{"content":"Car"}
}
Most of the examples ive seen are based on content output as text rather than any specific format.
Is there a better way to train models to focus on learning only the appropriate format of outputs and then to creating the content itself, rather than it following any specific content that i include within the JSON format?