Logit Bias default bias and blocking tokens not in list

ubersuperrareboss · December 16, 2024, 1:59am

How do I set a default logit bias value. Say I want to create generations where only certain tokens are allowed and to block all other tokens from showing up, how would I achieve this? I could possibly get all tokens and unless they are in my list, set their bias to -100 however I’m not sure how to get a set of all the tokens in the vocabulary and I’m also wondering if there’s a better way.

_j · December 16, 2024, 2:17am

You cannot demote every of 200000 logits.

Invalid 'logit_bias': too many properties. Expected an object with at most 1024 properties, but got an object with 10000 properties instead.

Instead, you would have to highly promote only those you want.

Additionally, this will prevent the AI from ending the output, a token you cannot control. You would have to use a very short max_response_tokens parameter.
Otherwise, provide a stop sequence and a token of finishing that you still can’t trust.

Then finally, validate across the input cases to see if the reproduction and determinations are still as you expect. If you only want the words “good” “neutral” “bad”, you have to ensure they still have the original qualities and ratios, which you can examine by logprobs.

ubersuperrareboss · December 16, 2024, 3:47am

If I wanted “good”, “neutral”, “bad” I would probably do some prompt engineering paired with a structured output. What I want is more like only allowing “uncomplicated” words, actually my task is for language learning. I want to only allow tokens the user has an understanding of, which would be in the range of a few hundred at the minimum. And I need it to be more sophisticated, for example I wouldn’t fully block tokens, but simply demote them so that if the model strongly predicts that token it would have no choice but to select it even if the user doesn’t know what it means. Also I would of course not include special characters in the demotion, so the stop token wouldn’t be an issue.

sps · December 16, 2024, 4:20am

Welcome @ubersuperrareboss

Logits are not the best way to achieve what you have described, in my opinion.

Here are two much more convenient and effective methods:

Pass the model the vocabulary to use within the system message along with an appropriate instruction.
You can also use grade level or age level descriptions within the system message. e.g., Please make sure that your vocabulary is lucid for a 2nd-grade student.

A good prompt would be able to influence the logprobs to favor your expected output.

Topic		Replies	Views
Does logit_bias remove tokens from top_logprobs? API	5	555	April 2, 2024
How do I use logit bias to ban phrases and not just individual tokens? API	2	2052	June 16, 2023
Logit bias does not work on gpt-3.5-turbo-instruct API gpt-35-turbo-instruc , logit-bias	3	887	December 21, 2023
Logit bias for words? Prompting	6	1729	December 21, 2023
Is it possible to register multiple words for the logit_bias parameter? API gpt-35-turbo	4	1143	October 10, 2023

Logit Bias default bias and blocking tokens not in list

Related topics