How to know which stop sequence was detected in a completion?

keithaw · May 15, 2023, 7:45pm

I’m using the Completions API and have multiple stop sequences that are passed into the request. If the stop sequences are stripped from the completion response, is there a way to know which stop sequence was “triggered”?

For example, suppose that I send a completion request like this:

response = openai.Completion.create(
  model='davinci',
  prompt='What is your favorite 3-letter word?',
 `stop=['FOO', 'BAR']`,
)

Let’s say the completion (without the stop sequence) is something like this:

My favorite 3-letter word is FOO because I like the Foo Fighters. I saw them in concert last summer... <continue until max_tokens>

But another possible completion might be:

My favorite 3-letter word is BAR because I like to drink and socialize... <continue until max_tokens>

Because “FOO” and “BAR” are both stop sequences, the model completion would stop generating tokens once it hits either one, so it would look like this:

My favorite 3-letter word is

On the client side, I would not know if “FOO” or “BAR” was generated since it is stripped from the completion that is returned in the response.

Is there any way to determine which stop sequence was triggered? Or is there maybe an option to include the stop sequence in the completion?

jesskreck · May 22, 2023, 11:36am

Are you using it on the Playground or did you implement the API in your own project? If you’re sending custom prompts to the API you can print the finish_reason of the response into your app. I’ll copy the response object from OpenAI’s API reference. Finish_reason is inside the choices array:

{
“id”: “chatcmpl-123”,
“object”: “chat.completion”,
“created”: 1677652288,
“choices”: [{
“index”: 0,
“message”: {
“role”: “assistant”,
“content”: “\n\nHello there, how may I assist you today?”,
},
“finish_reason”: “stop”
}],
“usage”: {
“prompt_tokens”: 9,
“completion_tokens”: 12,
“total_tokens”: 21
}
}

keithaw · May 25, 2023, 3:03pm

Thanks - yes, I’m implementing the API using the Python SDK. I see the finish_reason in the response, but it only tells you the whether the completion was terminated to to a stop sequence (i.e. "stop" or because it hit the token limit ("length").

In my case, I have more than one stop sequence and want to know which stop sequence was triggered when the finish_reason is "stop".

DavidOS366 · May 25, 2023, 3:52pm

Just for trial what happens if you send text like this below.
My favorite 3-letter word is BARBAR because I like to drink and socialize

jesskreck · May 26, 2023, 12:06pm

Ok, now i get it. So finish_reason is just saying that something out of the stop array which was send in the initial request was used to stop the creation/completion. I guess they modified the API so that it’s sending back “stop” as a string instead of an object. The object would probably contain the information you need. Did you try to manually change it? Otherwise you’re right, it’s not part of the documentation but it would actually be very useful if the response would send back that information

DavidOS366 · May 29, 2023, 1:59am

I am guessing that it’s also case sensitive. So BAR and bar are two different stop sequences.

keithaw · May 31, 2023, 3:41pm

@DavidOS366 yes the stop sequences are just exact string matches, so they’re case sensitive, and also your example of “BARBAR” would simply stop at the first “BAR”.

The real-world example of why I’m trying to do this is that I’ve fine-tuned a conversation model to use a format where the AI’s response either ends in a user prompt (e.g. “USER:”) if the conversation should continue or an end-of-conversation marker "e.g. “–END OF TRANSCRIPT–” if it’s done. If I included both as stop sequences, then the completions stop correctly but since they’re stripped out, I can’t tell which one was triggered.

So for now, I’m only including one of stop sequences in the API call and then relying on a hacky regex parser to look for the other sequence(s). It works, but is not ideal and would be much more efficient if the API could just tell me which sequence it triggered.

DavidOS366 · May 31, 2023, 3:52pm

You could try for B.ARBAR as stop sequence. So in your output you might get B.AR at the end, since it will stop at BAR and not B.AR

david.castillo · September 26, 2023, 11:46am

~~Hi. I was facing the same issue and I found out that if you set echo=True the API will send you back your prompt plus the response stop sequence included.~~

[edit] I believed this, but it was actually not true. I don’t see the reason why to exclude the stop sequence from the response. On the contrary: having it would be of great benefit. I suggest to include it as an additional element in the choice object (e.g. stop_sequence = null or str) to keep compatibility with current specification.

Topic		Replies	Views
Is it possible to stop a completion at the Nth occurrence of the stop sequence? API	9	1673	December 18, 2023
Feature request: option to prevent a completion from sampling the default stop token API	7	1368	December 14, 2023
Stop sequences being ignored API	5	2470	February 24, 2023
Difference in "tokens" and "text" in response Prompting	2	733	December 16, 2023
Fine tuning Completions are cut API	3	555	April 13, 2022

How to know which stop sequence was detected in a completion?

Related topics