Feature request: option to prevent a completion from sampling the default stop token

pde · February 2, 2022, 1:23am

While generating a response, the API’s models have various probabilities of producing stop tokens in various situations. Sometimes the caller just knows that this is the wrong behavior, and the model should create new output until it hits max_tokens (or until one of the caller-specified stop sequences is encountered).

In theory the caller can simulate this behavior by calling the API repeatedly (if they have no custom stop sequences), but it is much more expensive to do that. It would be much easier to be able to ask the sampling code to reject the stop token and retry [unless it has acquired probability 1.0, which seems like an unlikely corner case that may deserve a custom error].

pde · February 2, 2022, 2:50am

The side point about custom stop sequences is a subtle corner case and perhaps not that important. If you don’t provide any stop sequences, you can get the desired result by repeatedly calling back to Completions.create with the previous generation appended to your prompt, until you hit max_tokens. But if you provide any custom stop sequences, you can’t tell the difference between stopping due to one of those (which you may want), and stopping because of the default stop token (which you want to override and keep generating).

mikaell · February 2, 2022, 4:56am

i think you can do this through OpenAI API

mikaell · February 2, 2022, 5:39am

see the last sentence of the doc for that parameter. it’s actually one (special) token

mikaell · February 2, 2022, 6:01am

yep, that docstring also links to OpenAI API which lets you see the token ids of arbitrary text

mikaell · February 2, 2022, 6:22am

yeah that one’s a special token. a model could generate the characters <endoftext> but that wouldn’t stop the sequence because it isn’t the special token

mikaell · February 2, 2022, 6:52am

interesting, i think it’s catching a case where the model is trying to spit out a private api key.

Topic		Replies	Views
Is it possible to stop a completion at the Nth occurrence of the stop sequence? API	9	1588	December 18, 2023
How to know which stop sequence was detected in a completion? API text-completion , completions	8	8255	September 26, 2023
How can I prevent GPT-3 completions API to get stuck in repetition until MAX_TOKENS is reached? Prompting	2	1581	January 7, 2023
Any way to update request parameters mid-generation (logit_bias specifically)? Otherwise, a feature request to be able do so API	6	538	August 3, 2021
Feature request: get generated tokens back with request API chatgpt , api , feature-request	5	835	December 15, 2023

Feature request: option to prevent a completion from sampling the default stop token

Related topics