Surprising logprobs outputs for first token if it's '0'

mat.teubert · March 25, 2024, 4:41am

Hi, I’m making API calls in the completion API to get gpt-3.5-turbo-0125 to classify some prompt with ‘0’ or ‘1’. If the first token in the response is ‘0’ then the linear probability always equals 1 + logprobs for that token. But if the first token is anything else, or if ‘0’ is not the first token, we don’t get the same behaviour. Does anyone know why that’s the case? It’s not a problem per se, it’s just when patterns show up in your statistics you wonder why.

_j · March 25, 2024, 7:50am

A probability has a certainty of 0-1, representing a statistical chance 0%-100%.

Because there are 100k tokens in the BPE encoding dictionary all being evaluated on their certainty, the probability value of other tokens in the tail of alternates quickly becomes extremely small in their prediction probability. When you tell the AI to produce only a 0 or 1, the chance it produces the Chinese character for “book” is quite remote.

It becomes unwieldy to understand the small probabilities if they start with 20 fractional zeroes. Thus: logprobs, natural logarithm of the value, gives us the exponent for the formula: \text{prob} = e^{\text{logprob}}

Instead of dealing with a number like .00000000000004, the logprob ln(tiny) is -30.85.

This also gives temperature control a reasonable range when directly acting on logarithmic logits.

Now what happens in your depiction when the probability approaches 100%? The exponent approaches zero (-0), because of the mathematics anything^{\text{0}} = \text{1} . Euler’s number hardly matters.

So really, you are simply comparing a number nearly 1 to a number nearly 0, and seeing that the sum is also nearly 1. It doesn’t really mean anything.

Another cute math trick that exists for no reason is that for small angles in radians, sin(angle) ~= angle.
sin^-1(0.08) = 0.08008558 which looks like boobs.

You might find it more instructive to get the top-10 logprobs of the token position instead of just the one chosen.

Topic		Replies	Views
Logprobs and message.content are inconsistent API gpt-4 , api , logprobs	6	1788	April 11, 2024
Logprobs inconsistent between runs for 4o API logprobs	4	1211	September 11, 2024
Non-deterministic probabilities for first generated token in chat.completion? API	4	913	April 24, 2024
Clarifications on Log Probabilities for Chat Completion API api	1	2936	December 21, 2023
Logprob value is unbounded, i used sigmoid (converting API logarithm to probabilities) API	5	1660	November 12, 2023

Surprising logprobs outputs for first token if it's '0'

Related topics