How do I prompt the API to return me ONLY the probability from 0 a 100 of something?

stefano.ceravolo · March 31, 2023, 6:32pm

Example, I am thinking of a solution where I ask the API “, what do you think is the probability that the customer wants to buy the product in the chat. Please return only a number from 0 (least likely) to 100 (most likely)”.

However, the system returns often a larger text…any suggestions?

N2U · March 31, 2023, 7:02pm

You can reduce the number of max tokens, this will force the model to end after the set amount of tokens. Keep in mind that GPT is a language model that generates text, it’s not made to perform specific calculations or provide exact probabilities. It is however really good at pretending to do these tasks.

A better approach could be asking a series of binary questions like:

“Has the customer shown interest in the product?”
“Have they asked any questions about the product?”
“Have they mentioned any specific features they are looking for?”
“Have they mentioned any concerns or objections to purchasing the product?”

You can then use regular math to determine a probability score based on the number of “yes” responses.

curt.kennedy · March 31, 2023, 7:15pm

If you fine tune a base model on buy/not buy, you can use log_probs to give you a number between 0 and 1, and you multiply this by 100 to get your probability.

Admittedly, this is more advanced and would use the API, but totally doable.

N2U · March 31, 2023, 9:04pm

That’s a great idea, I didn’t even think about that, you won’t have as much knowledge about how said probability was calculated, but I don’t think it matters in this case.

curt.kennedy · March 31, 2023, 9:38pm

Yeah, it works!

Some working details you need to know … you need to convert the token_logprobs to a linear scale before converting it to a percent.

Example: I have a categorizer from Babbage that is trained to output ’ 0’ or ’ 1’. In this example, say ’ 1’ means the lead is going to buy, and ’ 0’ means the lead won’t buy.

"choices": [{"text": " 1", "index": 0, "logprobs": {"tokens": [" 1"], "token_logprobs": [-3.170517e-06], "top_logprobs": [{" One": -13.691312, " 1": -3.170517e-06}], "text_offset": [356]}, "finish_reason": "length"}], "usage": {"prompt_tokens": 124, "completion_tokens": 1, "total_tokens": 125}}'

Then define the base of the natural logarithm:

e = 2.718281828459

Then take the log probs out of the JSON response and convert to linear:

# get the log probability (base e)
LogProb = FullReaction["choices"][0]["logprobs"]["token_logprobs"][0]
print(f"LogProb: {LogProb}")
                    
# convert the log probability to a probability in [0,1]
Prob = e**LogProb

Then multiply the Prob value by 100 to get your percent from the chosen value. So if the value in linear from the chosen value is 0.99 for ' 0', you set the percent to 1-0.99 or 1%. If it is 0.99 for ’ 1’, you set it to 99%. Slight details, but that is all you need.

To get the log_probs info to show up in the response, I believe you just call it out, and since I am making a binary decision on 1-token, here is the core JSON settings to send over:

{"temperature": 0, "max_tokens": 1, "top_p": 1, "logprobs": 2, "frequency_penalty": 0, "presence_penalty": 0}

Not sure why request 2 of them, but I only use the single one from token_logprobs.

Details matter, but this is what you need to do to get it to work.

So in my example, I get

LogProb = -3.170517e-06 for token ' 1' = BUY!
Prob = e**LogProb
print(Prob)
# 0.999996829488026

SO the lead has a 99.9996829488026 percent chance of buying. WOOT!

Topic		Replies	Views
How do I design an effective question? API	13	636	February 25, 2024
Best way to make GPT estimate probabilities? Prompting prompt-engineering	9	1179	September 21, 2024
Logprobs for specific tokens, not just top tokens API api	5	252	January 24, 2025
Logprobs and message.content are inconsistent API gpt-4 , api , logprobs	6	1598	April 11, 2024
Logprob value is unbounded, i used sigmoid (converting API logarithm to probabilities) API	5	1632	November 12, 2023

How do I prompt the API to return me ONLY the probability from 0 a 100 of something?

Related topics