Logits_bias no longer fully working

AI-Roguelite · November 25, 2023, 10:16pm

It used to be that logits_bias of -100 would completely prevent the token from showing up, but now it seems to be more of a suggestion.

An example test prompt is:

{
  "model": "gpt-3.5-turbo",
  "messages": [
        {"role": "system", "content": "Follow the pattern and use brackets [] in your response."},
        {"role": "user", "content": "1: [test1]\n\n2: [test2]\n\n"}
    ],
  "max_tokens": 160,
  "temperature": 1.2,
  "stream": false,
  "logit_bias": {"510":-100}
}

It does bias the output, but it will still output the " [" token about 40% of the time.

Use case: This is problematic in a story-telling text generation where the prompt includes a lot of instructions or information encased in brackets. We’re supposed to generate only prose, but ChatGPT decided to sometimes copy the style of the prompt and include bracketed text, despite that all bracketed tokens (both with and without the preceding space) have been set as logits_bias -100.

Foxalabs · November 25, 2023, 10:21pm

Not looked into this yet, but have you tried like 999 or 1000 and not 100? Just wonder if there was a change in magnitude.

_j · November 26, 2023, 2:15am

A square bracket is not often a single token. It is within token sequences of hundreds of tokens like ( [") (characters within the parenthesis).

You can put the exact output generated into a token encoder and see if the character sequence including the bracket is another unanticipated token, but you just end up swatting flies.

AI-Roguelite · November 26, 2023, 2:33am

Yes; it doesn’t allow a magnitude lower than -100.

AI-Roguelite · November 26, 2023, 2:38am

Not in this case, it’s disproven in my simple example I provided in which the bracket is just the token of " [" (if answered exactly in the expected format)

_j · November 26, 2023, 3:09am

You may be working in the world of extreme certainty. Here, one I constructed for gpt-3.5-turbo-instruct:

How about then I throw +/- 0.01% into the logits?

It might be either optimizations that reduce possible logits, or randomness in the outputs that are unobservable (until OpenAI frees the logprobs.)

It is already observed that these new turbo models will produce random top-1 token choice flips even when all attempts at determinism are made. Why not a random normalized probability over 1.00000?

AI-Roguelite · November 26, 2023, 4:30am

Interesting, thanks for taking a look. I had this repro for longer text where a " [" shouldn’t have anywhere near 100% certainty. But I’m not sure if I’ll be able to get a reproducible example of that, since it might be random based on temperature.

Also, I think the fact that biasing to -100 does influence the output in my “test3” example, illustrates that it’s still a fixable bug (it reduces the chance of generating the correct answer from 100% to around 40-50%)

Foxalabs · November 26, 2023, 4:33am

I’ve moved this to the bugs category as it does seems like this should have at least a much stronger influence than it does.

sps · November 26, 2023, 8:05am

My thoughts precisely. Just because a substring is in the generated text, doesn’t mean it’s the same token.

@AI-Roguelite can you share the exact generated text?

Here’s an example where similar looking pieces of text have different token ids:

text:
image1382×410 4.04 KB

tokens:

emil.fplilja · December 29, 2023, 12:04pm

I had the same problem when trying to remove double quote marks (") from the response.

As @sps suggests, not only is " a token, but also "a, "b, "c and so on.
I see it’s the same case with brackets

Finding and removing these with logit_bias doesn’t seem like the right solution here. In my case i will be running the response through a post processing script instead.

cyzgab · December 29, 2023, 2:50pm

There are different token IDs because the second [ includes the line break.

You may find a package like instructor useful to get the model to constraint the output of the model to a specific data type. It uses pydantic under the hood.

Topic		Replies	Views
API parameter logit_bias is non-functional, not affecting output at all Bugs bug , api , logit-bias	2	51	January 25, 2025
Logit_bias not working how I would expect API	10	2455	December 21, 2023
Logit_bias not working as expected API	5	1117	April 18, 2024
Logit bias does not work on gpt-3.5-turbo-instruct API gpt-35-turbo-instruc , logit-bias	3	887	December 21, 2023
Logit_bias does not work now Bugs gpt-4 , api , logit-bias	2	707	March 24, 2024

Logits_bias no longer fully working

text: image1382×410 4.04 KB

tokens:

Related topics

text:
image1382×410 4.04 KB