When we use streaming with open ai models, I am not getting the token count

joyasree78 · August 23, 2023, 3:55pm

Is this by design or is there a way to get the token count. How do I calculate the token cost if streaming does not return the total token count

_j · August 23, 2023, 4:21pm

Yes, this behavior is documented; the only thing you’ll get is “finish reason” as the last delta.

For counting tokens, you can also record and append the deltas to make a total response. Then you can use a library, tiktoken, to count the tokens used by the text.

Compare the count of both your sent input and received output with the daily “usage” record of the same exchange in the account management web page (isolated 10 minutes from other queries to get an individual record), to ensure the calculation and accounting is done correctly.

Example of a simple looping chatbot:

import openai
openai.api_key = "sk-xxxxx"
system = [{"role": "system", "content": "You are a helpful AI assistant."}]
user = [{"role": "user", "content": "Introduce yourself."}]
chat = []
while not user[0]['content'] == "exit":
    response = openai.ChatCompletion.create(
        messages = system + chat[-10:] + user,
        model="gpt-3.5-turbo", stream=True)
    reply = ""
    for delta in response:
        if not delta['choices'][0]['finish_reason']:
            word = delta['choices'][0]['delta']['content']
            reply += word  # appends the deltas to record the whole response
            print(word, end ="")

# here, by tiktoken, you can calculate the length of:
#"user['content']", "system['content'], chat message contents sent,
# (or a function that understands formatted "system + chat[-10:] + user" sent)
# plus the "reply" variable that has the bare response text

    chat += user + [{"role": "assistant", "content": reply}]
    user = [{"role": "user", "content": input("\nPrompt: ")}]

github.com

openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb

{
 "cells": [
  {
   "attachments": {},
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# How to count tokens with tiktoken\n",
    "\n",
    "[`tiktoken`](https://github.com/openai/tiktoken/blob/main/README.md) is a fast open-source tokenizer by OpenAI.\n",
    "\n",
    "Given a text string (e.g., `\"tiktoken is great!\"`) and an encoding (e.g., `\"cl100k_base\"`), a tokenizer can split the text string into a list of tokens (e.g., `[\"t\", \"ik\", \"token\", \" is\", \" great\", \"!\"]`).\n",
    "\n",
    "Splitting text strings into tokens is useful because GPT models see text in the form of tokens. Knowing how many tokens are in a text string can tell you (a) whether the string is too long for a text model to process and (b) how much an OpenAI API call costs (as usage is priced by token).\n",
    "\n",
    "\n",
    "## Encodings\n",
    "\n",
    "Encodings specify how text is converted into tokens. Different models use different encodings.\n",
    "\n",

This file has been truncated. show original

Topic		Replies	Views
How do you get token count when streaming API	4	4199	December 19, 2023
Chat completion "stream" API token usage API api	3	6274	May 6, 2024
Calculating token usage with streaming? API	2	2061	May 6, 2024
Token counter in stream. Help API	4	838	December 19, 2023
How to get token usage for each API call in streaming model? API	9	8318	December 14, 2023

When we use streaming with open ai models, I am not getting the token count

Related topics