I am finetuning gpt3.5 model for meal search app in a conversational way, such as it should have a context of what I am talking about.
My question is, does the open ai api train all the assistant responses from message item as output training example or it just took the last assistant response to train the output, while putting all the previous responses as input context?
For example all the conversation in a single message item.
jsonl format
{"messages":[
{"role":"user","content":"i would like to eat biryani"},
{"role":"assistant","content":"Do you have a specific type of biryani in mind or do you want me to search for the general biryani item?"},
{"role":"user","content":"all items"},
{"role":"assistant","content":"sure let me do this", "function_call":...},
{"role":"function","content":"tool call response ... "},
{"role":"assistant","content":"Here are some of the delighted biryani items including ..... do you want to try chinese items as they are also rice base option??"}
]}
for seconds case:
{"messages":[
{"role":"user","content":"i would like to eat biryani"},
{"role":"assistant","content":"Do you have a specific type of biryani in mind or do you want me to search for the general biryani item?"}
]}
{"messages":[
{"role":"user","content":"i would like to eat biryani"},
{"role":"assistant","content":"Do you have a specific type of biryani in mind or do you want me to search for the general biryani item?"},
{"role":"user","content":"all items"},
{"role":"assistant","content":"sure let me do this", "function_call":...}
]}
{"messages":[
{"role":"user","content":"i would like to eat biryani"},
{"role":"assistant","content":"Do you have a specific type of biryani in mind or do you want me to search for the general biryani item?"},
{"role":"user","content":"all items"},
{"role":"assistant","content":"sure let me do this", "function_call":...},
{"role":"function","content":"tool call response ... "},
{"role":"assistant","content":"Here are some of the delighted biryani items including ..... do you want to try chinese items as they are also rice base option??"}
]}
So which format is better for a model to train and understand the context better and respond accordingly?