I’m designing an interactive game where Assistant initiates conversations with a human player.
There are three different methods to establish the communication rules and conversation sequence:
-
System Prompt Initiation: The rules are explained in the system prompt, and at the end of this prompt, it instructs the assistant to start the conversation.
-
User Prompt Initiation: The rules are explained in the first user prompt, following which the assistant is expected to start the conversation.
-
Delayed Assistant Start: The rules are given in the system prompt, and it is specified that the user will send the message “start” to signal the assistant to begin the conversation.
My concern is about the potential differences in performance among these configurations.
Specifically, I am worried that there might be less training data available for sequences that start with the system prompt followed by the assistant’s response (i.e., system - assistant - user - assistant - user…), compared to sequences that begin with the system prompt followed by the user’s first query (i.e., system - user - assistant - user - assistant…).
This difference in data availability might affect the alignment and performance of the chatbot. Is my concern reasonable?