How to know # of tokens beforehand when I make function calling + chat history request witn NodeJS

finom · July 6, 2023, 10:27am

I’ve got the following code:

const completion = await openai.createChatCompletion(
      {
        messages,
        stream: true,
        functions,
      },
      { responseType: 'stream' }
    );

How would I know in advance before sending this request that # if tokens is going to fit the model limit?

I found this library npm/gpt-tokenizer but it doesn’t seem to cover size of functions. Can I just use encode with stringified array of functions like that encode(JSON.stringify(functions)).length? Or if there better solution for that?

udm17 · July 6, 2023, 11:42am

I think the simplest way would be to count the tokens in the prompt and check those against the limit of the model you are running against. Conversely, this will also allow you to figure out how many token are allowed in the generation of the output from GPT.

I’m not sure whether this is JS or Java but the Open AI docs recommend gpt-3-encoder - npm while it is GitHub - openai/tiktoken: tiktoken is a fast BPE tokeniser for use with OpenAI's models. for python

For reference: OpenAI Platform

finom · July 6, 2023, 3:31pm

Sorry, but it doesn’t answer my question.

jwatte · July 6, 2023, 3:42pm

Yes it does.
tiktoken lets you take a text, and count the number of tokens in it.
You should do this with the sum of all your prompts (and their roles.)
This will get you the total size of tokens input.
The total number of tokens the model can generate in addition, is then (model_size - input_tokens)
This way, you will know whether it will fit or not.

stevenic · July 6, 2023, 3:49pm

Getting the exact token count is going to be difficult because we have no clue how they’re embedding functions into the prompt at this point. I would hope they’re not just shoving the JSON schema in there as that would be a massive waste of tokens.

I’m using gpt-tokenizer in my projects as well and your current approach is probably a reasonable approximation of the added token overhead.

I haven’t tried this yet but you might try counting input tokens and compare that to what you get back in the usage section of the response.

stevenic · July 6, 2023, 4:12pm

So actually someone posted a better approach here:

Topic		Replies	Views
How to tokenize a request with functions? API chatgpt , plugin-development , api , token , functions	7	5715	December 15, 2023
Input tokens and function definitions API	5	1405	June 14, 2023
How do you get tokens count gpt3.5-0613 ‘functions’ param？ API	2	1011	August 28, 2023
Function execution result: how to produce less tokens? API api	5	1299	July 13, 2023
How many different function calls can a model learn how to do? API function-calling	10	1099	August 11, 2024

How to know # of tokens beforehand when I make function calling + chat history request witn NodeJS

Related topics