How do I use logit bias to ban phrases and not just individual tokens?

mpal1997a · June 14, 2023, 8:53am

I am trying to ban the word ‘assume’, in my generations. I am trying to use logit bias parameter to accomplish this but the word ‘assume’ as per the OpenAI tokenizer site takes up two tokens. So do I ban both of these tokens?

This is what my current code looks like:

def get_completion_from_messages(messages, model=“gpt-3.5-turbo”, temperature=0, max_tokens=100):
response = openai.ChatCompletion.create(
model=model,
messages=messages,
temperature=temperature,
max_tokens=max_tokens,
logit_bias={“562”: -100, “2454”:-100}
)

This however does not seem to work.

Any help would be appreciated!

jobNeople · June 16, 2023, 8:45am

Hi there! Due to the way tokens work, it’s probably best to try to suppress “ assume”, including the preceding space (token ID 7048). This won’t reliably remove all mentions off assume, as it will ignore “assume” without preceding space, or “Assume” (both with and without space), but I suppose it will help a fair bit.

Hope it helps!

_j · June 16, 2023, 3:42pm

The site you link is the 50k tokenizer for GPT-3 models, but the engine you’d be using through ChatComplete is GPT-3.5-turbo or 4. They use a 100k dictionary with different token numbers.

" assume": 9855
“assume: 46151
" Assume”: 63297
“Assume”: 5733 + 3972 (and other variants are also compound tokens)
“presume”: 24544 + 3972 (showing that you will stifle language if you block a fragment)

Topic		Replies	Views
Logit Bias default bias and blocking tokens not in list API logit-bias	3	143	December 16, 2024
Removing complex words from generation API gpt-4	8	812	July 12, 2023
Is it possible to register multiple words for the logit_bias parameter? API gpt-35-turbo	4	1169	October 10, 2023
Logit bias does not work on gpt-3.5-turbo-instruct API gpt-35-turbo-instruc , logit-bias	3	892	December 21, 2023
Logit bias for words? Prompting	6	1765	December 21, 2023

How do I use logit bias to ban phrases and not just individual tokens?

Related topics