Limiting the llm usage per user based on token cost

nameuser · July 25, 2024, 4:46pm

I have developed a tool utilizing OpenAI’s GPT-3.5-turbo-1106, integrated with LangSmith and LangGraph. I am looking to implement a feature that limits the usage of my bot based on a certain cost threshold for OpenAI API calls. Specifically, I want to allow users to make free requests until they reach a $5 monthly limit, similar to how GPT limits requests for free users to access their gpt4o model

Im using langGraph

#For final response
from langchain_core.messages import HumanMessage

inputs = {
“messages”: [
HumanMessage(
content=‘Can you delete the user with email mike@gmail.com.’
)
]
}

Use the Runnable to get the final state

final_state = app.invoke(
inputs,
config={“configurable”: {“thread_id”: 42}}
)

Print the final response

print(final_state)
The final_state doesn’t provide any information regarding tokens.
I tried using get_openai_callback function but it didnt work either.
How do i get to know the tokens usage per api call so that I can restrict the user after they reach their limit.

Topic		Replies	Views
API usage billing for SaaS application products API gpt-4 , plugin-development , api	6	1989	March 15, 2024
GPT subscription based on token consumption API api	2	124	October 20, 2024
$5 consumed in 4 noob requests or less API	5	103	October 29, 2024
How can I implement a subscription model based service for my project that gives users so many requests depending on the plan they purchase API gpt-4 , chatgpt , plugin-development , fine-tuning , api	1	164	December 10, 2024
So how much will openai chat gpt 3.5 cost for me? Community chatgpt , pricing , chatbot	1	1296	March 14, 2024

Limiting the llm usage per user based on token cost

Use the Runnable to get the final state

Print the final response

Related topics