Function calling temperature

finom · August 3, 2023, 8:14pm

We have automated tests that check if we implemented our function calling properly. The tests look something like that, just for general information:

await testToolPrompt(['Hours on job 12345'], {
    name: 'my_tool',
    arguments: {
      jobId: 12345,
      isHoursRequested: true,
    },
  });

The test calls completion function hundreds of times to make sure that AI returns proper arguments (second argument is what we expect). With stats of success and fail we know if our function is not OK and requires additional changes.

I’ve played with temperature and surprising for me it makes significant effect on quality of function calling responses. I naively thought function calling isn’t just a text generation and should be precise with any temperature given. With temperature 0 we have all arguments and values we expect, but with temperature 2 it generates random arguments that make no sense at all. With temperature between 0 and 2 we have average but still low quality results.

We want to keep possibility to set temperature for general text generation, but we also expect our functions to generate precise arguments. Is there a workaround that allows us to keep using temperature but make it not mess up our function calling? Thank you!

_j · August 4, 2023, 2:45am

The AI generates tokens, whether the text of squirrel poems or the invocation of function parameters, and it is still processed through the logit function allowing less-likely token options to be chosen, unless temperature is cranked up to 0.01. (100x logprob spread).

I’ve started (and finished for me) such an idea. It would be improved if it understood the information domain of asking that would invoke functions.

One might resubmit if a function was invoked - but the very action of a function being called instead of content text being produced is decided on by the first token output probability.

Topic		Replies	Views
An adaptive classifier for the best temperature for generating API responses API api , functions , temperature	4	1412	February 16, 2024
Optimal Temperature for Generating Code API api-temperature , temperature	4	3066	May 13, 2024
Reliability of using functions as output for classification tasks API gpt-35-turbo , functions	6	1955	September 10, 2023
Why is the temperature and top_p of o1 models fixed to 1 not 0? API	3	4964	September 16, 2024
Temperature doesn't seem to affect function calling API api	0	49	February 23, 2025

Function calling temperature

Related topics