Recently, I came across a doubt regarding which role should be used for the “Context” message in a query sent to ChatCompletion API.
Since the very beginning, I was following OpenAI Cookbooks and till today used the “user” role for this message, e.g.
query = [
{“role”: Role.SYSTEM, “content”: prompt},
{“role”: Role.USER, “content”: context},
{“role”: Role.USER, “content”: question},
]
A few days ago just out of curiosity, I asked ChatGPT this question and it came up with the role “assistant”. The reasoning it provided was very sound (you can test yourselves) and I switched my test environment to use “assistant” instead. And so far I could not spot any decrease in answers’ quality.
Questions to OpenAI folks:
which role would you suggest and why (besides what’s in cookbooks)?
please kindly explain why is there a discrepancy between cookbooks and ChatGPT suggestions (also just curiosity :)?
Subject to its rules, the Model Spec explicitly delegates all remaining power to the developer (for API use cases) and end user. In some cases, the user and developer will provide conflicting instructions; in such cases, the developer message should take precedence. Here is the default ordering of priorities, based on the role of the message:
Platform > Developer > User > Tool
Note: Though not included here the assistant role is assumed to be about on par with Tool.
So, depending on how much you want the model to adhere to the retrieval as fact should dictate what role you assign it to.
I want model to NOT perceive content of context data as something provided by user, because it is not user’s input but the attempt of the most appropriate documentation I could match using embeddings to help model answer the question. In my case, data in context message contains both - KB chunks (documentation/tutorials) and some examples of similar (user-agent) conversations about the same/similar topics.
I wish context content is treated as a fact hence this should not be user message, right? Then on other hand, system messages are not meant for large context amounts!? Therefore, the assistant role seems the most logical choice. What I’m missing?
Thank you! Indeed, there is no separate limitation for length of system messages. Changed roles to system for all non-user content and my fidelity tests are passing now even for questions having mixed results previously.
Are there known drawbacks of having multiple system messages within a single query?