Fine-tuning 3.5Turbo (1106) questions

whitneyland · January 17, 2024, 3:50pm

For fine-tuning (1106), do system instructions need to be in every Q/A pair? [1]
If not, why does the documentation show it that way with the example below? [3]
What is an “open ended training session”, where “just user or assistant filled in and the other is blank”? How does this work, is it documented anywhere? [2]

@Foxalabs said another thread:
[1] You of course don’t need a “system” prompt in your examples, it could just be the user and assistant roles,

[2] or even just user or assistant filled in and the other blank as part of an open ended training session.

Example from fine-tuning documentation:

[3] {“messages”: [{“role”: “system”, “content”: “You’re a factual chatbot.“}, {“role”: “user”, “content”: “Capital of France?”}, {“role”: “assistant”, “content”: "Paris it is.”}]}

{“messages”: [{“role”: “system”, “content”: “You’re a factual chatbot.“}, {“role”: “user”, “content”: “Capital of France?”}, {“role”: “assistant”, “content”: "Paris it is.”}]}

Foxalabs · January 17, 2024, 3:52pm

Hi,

Did you need anything clarifying?

whitneyland · January 17, 2024, 3:57pm

Yes that would be great.

Specifically I’m looking for answers to the three questions in my post. Partly to confirm your other post still stands, partly to understand more how it works.

Thanks for the fast reply!

Foxalabs · January 17, 2024, 4:12pm

Ok, so, you don’t need a system instruction. However, if you subsequently use a system instruction when using the fine tuned model, performance will be degraded. It is highly recommended, but there may be some situation (can’t think of any off hand) where that might be of use.

The documentation does not show it because most people will always include a system instruction, however this is a developer forum and so I tend not to sugar coat anything and instead give the actual basic requirements as that tends to be what developers want.

Open ended is where you do not have a Q in the Q/A part of the training, so, in this case you leave the user role blank and only supply the assistant role with text. This can be useful for application such as creating a fine tune of an authors works to create a model that creates text in the style of the author. It is a more advanced and less used aspect of fine-tuning

whitneyland · January 17, 2024, 5:12pm

Very helpful thank you. To confirm I’m hearing you right:

Using a system instruction with the fine-tuning data is not technically necessary but highly recommended.

On the other subject of providing only an answer instead of full Q/A, is there anywhere else I can read more about the applicability of this, like docs or a research paper? Or is it just kind of a heuristic / learned through experience?

jr.2509 · January 17, 2024, 5:24pm

Just a minor addition / reiteration here: the consistency in your training prompts and the prompts when you apply the model in practice is very critical. For instance, if you include a system prompt with specific instructions in your training, you can’t leave that out when you use the finetuned model later. The model will eiter return poor output or you might get an error message altogether.

I would also be interested in reading a bit more about the open ended training sessions. Sounds quite interesting.

Foxalabs · January 17, 2024, 5:47pm

I don’t have any papers on open-ended fine-tuning I’m afraid, everything I have done and learned has been in collaboration with members of this forum, it’s a rather new discipline and lacks a lot of the traditional “dusty tombs” for reference, we are basically the ones laying the groundwork.

I should add, I would not be surprised if there are papers on this topic, but I have not come across any in the wild, if anyone has, please feel free to link them.

Topic		Replies	Views
Do the system messages in GPT 3.5 Turbo fine-tuning need to be the same for all entries? API gpt-35-turbo , fine-tuning , fine-tuning-problems	8	2345	October 3, 2024
Can I fine-tune the model without the prompt and answer for the "system" role? API gpt-35-turbo , chatgpt , api	12	6348	January 29, 2024
Finetuning for shortening prompts Documentation fine-tuning	10	3817	December 24, 2023
Fine tuning - Am I missing something? API fine-tuning	2	232	August 22, 2024
System prompt on finetuning API	6	3233	November 6, 2024

Fine-tuning 3.5Turbo (1106) questions

Related topics