System role in fine tuning

AltDev · April 7, 2024, 6:35am

I am trying to construct a fine tuning dataset for GPT3.

The example structure for messages given in the documentation is:

{“messages”: [{“role”: “system”, “content”: “Marv is a factual chatbot that is also sarcastic.”}, {“role”: “user”, “content”: “What’s the capital of France?”}, {“role”: “assistant”, “content”: “Paris, as if everyone doesn’t know that already.”}]}

What exact content needs to be put in the “System” entry? The example of “Marv is a factual chatbot that is also sarcastic.” does not provide any useful guidance.

Is this supposed to contain my entire set of instructions for the Assistant, just a summary of the instructions, or something else? It is unclear what content is needed here and it is unclear how it is used in fine-tuning.

sps · April 7, 2024, 7:06am

According to my understanding, the system message is supposed to set the behavior that the assistant should be exhibiting.

There’s no need for instructions. Put simply, one of the reasons for fine-tuning is to save on instruction (prompt) tokens and go from input data to desired output.

noumanjavaid · April 7, 2024, 10:51am

Hey @AltDev so, regarding your query you first have to test thoroughly the prompt and make iterations in that, and if that doesn’t work then find the closest one that works with few-shots prompting and then say model itself to create a prompt for it which is called meta prompting.

When you have the closest prompt that actually works use that one as the system one and it has to be consistent in all the examples.

If you could tell me more about what you are doing then I could guide you better.

Topic		Replies	Views
Fine-tuning dataset : system, user and assistant content : where to put the real instructions? API fine-tuning	1	963	December 29, 2023
Different roles in the API and their use cases API	1	8619	April 16, 2024
Prompts for system & assistant roles? API	5	16285	December 13, 2023
Do the system messages in GPT 3.5 Turbo fine-tuning need to be the same for all entries? API gpt-35-turbo , fine-tuning , fine-tuning-problems	8	2291	October 3, 2024
What should be included in the System part of the Prompt? API api	14	35010	March 30, 2024

System role in fine tuning

Related topics