Too much tokens - 1.794 tokens for 60 words - help to understand

rrg890 · May 15, 2023, 7:36pm

I am working onbuilding an own chatbot introducing my own data. I already did but my surprise was the amount of tokens are counted. Here there is a real example:

Prompt: “y qué es una estructura de precio?”

Completion: “Una estructura de precio se refiere a una horquilla de ofertas de compra y ofertas de venta que hay en un activo financiero, que condiciona la formación de precios de los mercados financieros. Los precios se determinan por la oferta y demanda de un activo, y por los conflictos entre compradores y vendedores.”

Here you can see how openai counted those tokens:

19:10
Local time: 15 may 2023, 21:10
text-davinci, 2 requests
1,673 prompt + 112 completion = 1,785 tokens

19:10
Local time: 15 may 2023, 21:10
text-embedding-ada-002-v2, 1 request
9 prompt + 0 completion = 9 tokens

During those minutes ONLY that query explained at the beginning was made.
I am using chat history in order to follow the conversation having. Here is the code:

@bot.message_handler(func=lambda msg: True)
def echo_all(message):
    global qa, chat_history
    try:
        result = qa({"question": message.text, "chat_history": chat_history})
        chat_history = [(message, result["answer"])]
        bot.reply_to(message, result["answer"])
    except:
        bot.reply_to(message, "Actívame pulsando /start.")

Actually I am only interacting with openai in order to fill the variable “result”. This means 2 requests for davinci and 1 request to embedding ada. Can someone explain how this countability works?

Thanks!
Ramon.

shane.isbell · May 15, 2023, 9:58pm

I would suggest taking a look at the message you are getting back from the completion endpoint. There is a section the response that gives the usage for the request. The prompt tokens are what you sent and the completion tokens are what you get back in the response. That should be accurate.

“usage”: {
“prompt_tokens”: 180,
“completion_tokens”: 41,
“total_tokens”: 221
},

rrg890 · May 16, 2023, 4:05pm

In the response I only see ‘question’, ‘chat_history’ and ‘answer’. Nothing related to tokens.

shane.isbell · May 16, 2023, 11:57pm

Are you calling the endpoint: v1/chat/completions

rrg890 · May 20, 2023, 8:48am

I am using library openai with langchain with python

merefield · May 20, 2023, 10:49am

Have you raised this issue on their repository yet? The details matter

N2U · May 20, 2023, 11:38am

This might be the issue, I have also been using langchain, and langchain tends to consume a lot of tokens that aren’t necessarily shown in the output. You can see all those tokens if you set the langchain parameter verbose=True

nahaajayk · September 1, 2023, 7:35am

I have the same issue. I asked two questions and it cost me $0.26. Using /v1/chat/completions. Code is in JavaScript.

Topic		Replies	Views
Overheads 'tokens'? Usage does not add up API	3	1278	September 20, 2023
Unexpected High Token Usage on OpenAI API Community gpt-4 , chatgpt , api	1	223	January 26, 2025
How many tokens is normal usage for asking a question? API chatgpt	7	13550	September 6, 2024
Prompt tokens usage seems too high API api	1	2514	January 21, 2024
Am I begin overcharged for o1-mini? API o1-mini	5	447	September 30, 2024

Too much tokens - 1.794 tokens for 60 words - help to understand

Related topics