After many tries, I found out that part of the problem was caused by concatenation of phrases after punctuation (for a bug of my dataset generation script).
After that correction and the adding of #### stop sequence I got a lower error rate for the double phrases, but those still happens sometimes
For reference here a line of dataset without system prompt inside
"messages": [
{
"role": "system",
"content": ""
},
{
"role": "assistant",
"content": "Hello! I'm the assistant from Osteria Del Alberto, how may I assist you?########"
},
{
"role": "user",
"content": "hi i would like to order some pizzas for delivery for tomorrow evening"
},
{
"role": "assistant",
"content": "Wonderful! I can proceed with your order, may I have your address, please?########"
}
],
"functions": [
{
"name": "delivery_address",
"parameters": {
"type": "object",
"required": [
"address"
],
"properties": {
"address": {
"type": "object",
"required": [
"street",
"number",
"city"
],
"properties": {
"city": {
"type": "string",
"description": "City name"
},
"number": {
"type": "string",
"description": "Street number"
},
"street": {
"type": "string",
"description": "Street name"
}
},
"description": "Delivery address"
}
}
},
"description": "Delivery address"
},
{
"name": "takeaway_change",
"parameters": {
"type": "object",
"required": [
"change"
],
"properties": {
"change": {
"type": "boolean",
"description": "Change from delivery to takeaway"
}
}
},
"description": "Change request from customer to do a takeway order, ask for confirmation before calling the function"
},
{
"name": "reservation_cancel",
"parameters": {
"type": "object",
"required": [
"annulla"
],
"properties": {
"annulla": {
"type": "boolean",
"description": ""
}
}
},
"description": "Previous done reservation change or cancellation request from the customer"
}
]
}