Why does my completion sometimes return the token with the higher log-loss of the two?

anders.h.knospe.23 · December 4, 2022, 7:29pm

I’ve got a fine-tuned model that is trained to output one of two classes (“yes” or “no”). In a small minority of cases, the “text” field returned by the completion contains one class, e.g. “yes”, but when I look at the “top_logprobs”, that class is significantly higher (so lower probability) then the other.

For instance, there is a row where text is “yes”, and the first element of top_logprobs is {“no”:-0.0232, “yes”: -3.7727}. I’m wondering if this might have something to do with calculating probability with torch.sigmoid(logprob) vs. np.e**logprob (what I am doing right now), but I’m not sure?

anders.h.knospe.23 · December 4, 2022, 7:54pm

I’m thinking the issue might be that GPT is using a sigmoid function on the logloss values and then checking if they’re over 0.5 (rather than using e^logloss or a softmax and then taking the max probability). The cases it goes wrong are where the sigmoid resulting in a larger probability is less then 0.5… I can’t find anything in the reference about making completions use e^logloss or softmax – would appreciate any help there.

Topic		Replies	Views
Surprising logprobs outputs for first token if it's '0' API logprobs	1	889	March 25, 2024
Completion does not use highest probability token API	4	746	August 14, 2023
Logprobs keep changing when using the same prompt in chat.completion API api	3	1515	March 5, 2024
Non-deterministic probabilities for first generated token in chat.completion? API	4	917	April 24, 2024
Clarifications on Log Probabilities for Chat Completion API api	1	2945	December 21, 2023

Why does my completion sometimes return the token with the higher log-loss of the two?

Related topics