Repetition of phrases on completion

softwarehouse · March 12, 2024, 9:51am

I’ve tried to train GPT3.5 1106 and 0125 on azure with 30-40 row examples of 4 different scenarios.
The result that I got is a model where very often responses are repeated, for example I get this output:

Perfect! Can I ask you if you would like to order for home delivery, takeaway or book a table? Perfect! Can I ask you if you would like to order for home delivery, takeaway or book a table?

Instead of simply giving me back it once.

I know that the number of examples is really low, I’m already planning to remove similar name and enhance the dataset, but i was not expecting this behavior coming out from a fine tuning.

jr.2509 · March 12, 2024, 9:53am

Hi there -

Could you share the values for the hyperparameters such as temperature, frequency penalty, presence penalty etc. that you are using?

softwarehouse · March 12, 2024, 10:00am

Sure,

Hyperparameters:

Number of epochs: 2
Batch size: 1
Learning rate multiplier: 1
Prompt loss weight: -

These where the standard coming from azure default

I could not find any parameter about temperature, frequency penalty, …

jr.2509 · March 12, 2024, 10:01am

Thanks. I meant the hyperparameters when you actually use the fine-tuned model in practice.

softwarehouse · March 12, 2024, 10:02am

oh ok those are the same as the non tuned model:
seed: 0
model: “GPT35_CUSTOM”
max_tokens: 150
temperature: 0
frequency_penalty: 0.8

jr.2509 · March 12, 2024, 10:09am

Ok, interesting. You already included a higher frequency penalty. You might want to give it a try to also add presence penalty, perhaps starting with a value of 0.5

sps · March 12, 2024, 10:10am

Welcome @softwarehouse

This has to do with stop sequence. Stop sequence is a token or set of tokens that you may have appended at the end of the each assistant training data sample.

https://help.openai.com/en/articles/5072263-how-do-i-use-stop-sequences-in-the-openai-api

softwarehouse · March 12, 2024, 10:11am

I understand, so for each assistant message do I need to add stop sequence or just for the final assistant message?

sps · March 12, 2024, 10:14am

Stop sequence are used during fine-tuning by appending them to the end of expected assistant response.

When you consume the fine-tuned model, the same stop sequence is passed via the stop parameter.

You can also figure out a stop sequence of all your assistant messages have a common token they end with, ideally this token wouldn’t appear anywhere in the generated text except the ending.

softwarehouse · March 12, 2024, 10:16am

Why isn’t that mentioned on the fine tuning guide?
https://platform.openai.com/docs/guides/fine-tuning/preparing-your-dataset

I understand but is there any documentation about it I would like to understand it better before adapting my dataset.

softwarehouse · March 12, 2024, 10:18am

So I should just add it to the latest message and not to the middle assistant message of a conversation. Right?

For example is there any default stop sequence that is used by standard gpt3.5 model?

sps · March 12, 2024, 10:24am

I think we can still try to figure out possible stop sequences of you could share endings of the last assistant messages from your JSONL file.

_j · March 12, 2024, 10:26am

Because it is not a thing that is required if tuning the chat completions model. The stop token is already built into the format and should be trained, and trained with lots of repetitions of it.

However, especially when doing fine-tune that includes functions as examples, it seems damaged. Just one more thing that OpenAI has left unaddressed. It is especially broken in that the AI doesn’t continue into producing a “user” token, it repeats its own output.

The normal stop tokens are 100265 and 100260, depending on what the AI is emitting.

You’ll need to include a bunch of normal conversation outside of functions, and then end with your own stop token sequence like @!@!@!@!@ in training. Then you can stop at @!@ with the API call.

softwarehouse · March 12, 2024, 10:29am

This are some examples of messages from last assistant message:
normal message:

        {
            "role": "assistant",
            "content": "Siamo chiusi stasera. Mi dispiace."
        }

function calling:

      {
            "role": "assistant",
            "function_call": {
                "name": "selezione_azione",
                "arguments": "{\"selezione\":\"takeaway\"}"
            }
        }

softwarehouse · March 12, 2024, 10:31am

Yes I’ve seen that for function call tuning everything seems to be a little messed up, from the naming convention (tool call uses tool_call_id and finetune uses function_name).

softwarehouse · March 12, 2024, 10:32am

So on this message that is the last one

       {
            "role": "assistant",
            "content": "Siamo chiusi stasera. Mi dispiace."
        }

I should add the @!@!@!@!@ at the end?
like this?

       {
            "role": "assistant",
            "content": "Siamo chiusi stasera. Mi dispiace.@!@!@!@!@"
        }

_j · March 12, 2024, 10:37am

That would be the idea. I use @! as they can’t be joined with each other. However, upon reflection, the AI might get confused about which to produce first if training on a bunch of them, so maybe just one instance of the sequence. You can also do something like ########, uniformly and always the same length, which is in fact a single token.

softwarehouse · March 12, 2024, 10:48am

I’ll try to add ######## at the end of each assistant message, then retrain it and I’ll need also to add ######## to the stop sequence while doing inference right?

_j · March 12, 2024, 10:53am

Yes, and you shouldn’t need to exactly match the token number as a stop sequence, so simply #### will catch many alternate tokens if the AI tries to write it differently or shorter. This will prohibit the use of the extended sequence of pound sign in the AI output, but the only place that is probable is in repeating back code.

softwarehouse · March 12, 2024, 10:55am

Thank you so much, I’ll give you a feedback on how the process will go.

UPDATE:
It seems to work quite better, it still have some repetition but I think that is because of the small dataset from which the model is finetuned. I’ll try to enhance that in the next days.
Thanks for the support

Topic		Replies	Views
Fine tuning repeating pieces of text API	2	1384	January 26, 2023
Fine-tuned a davinci model repeats phrases in answers Prompting	5	1332	December 18, 2023
Fine tuning reducing randomness API	4	504	December 20, 2023
Fine tuned model produces responses that make it seem like it hasn't been fine tuned at all API fine-tuning , fine-tuning-problems	1	912	September 14, 2023
Fine tuned model generates repeated stop sequences API	1	399	May 3, 2023

Repetition of phrases on completion

Hyperparameters:

Related Topics