How to add Stop Sequence for the model "gpt-3.5-turbo-1106" fine-tuning?

anon22939549 · January 13, 2024, 10:01pm

The idea of a stop_sequence in the training data for fine tuning is to just use the same tokens or sequence of tokens at the end of every response so the model “learns” to,

Include it in every response
Stop talking after it does

I wouldn’t recommend using the internal end of message token for this, rather just a token or sequence of tokens which are incredible unlikely to be naturally produced by the fine-tuned model for your use case.

Some interesting choices might be,

\a: bell
\b: backspace
\f: form feed
\v vertical tab
Etc

I’m not entirely certain how each of these would work in practice^[1], it seems most people simply use some combination of some number of # and \n.

This is on my list of things to experiment with ↩︎

Topic		Replies	Views
Stop sequences being ignored API	5	2396	February 24, 2023
How to stop a fine-tuned model from generating additional tokens? API	2	1551	February 23, 2022
Finetuned GPT-3 Repeating responses API fine-tuning	1	623	July 12, 2023
Fine-tuned model in a chatbot gives responses for both the chatbot and the user API	5	2365	March 16, 2023
Fine-tuning divinci and end of response issues API fine-tuning , api	4	1390	May 24, 2023

How to add Stop Sequence for the model "gpt-3.5-turbo-1106" fine-tuning?

Related topics