Issues with High Token Usage in Assistants API for Chatbot Responses

iliuha1993 · May 28, 2024, 5:29pm

I have found a solution for creating a chatbot that can answer customer questions on behalf of our company. We have many questions and answers related to a ski resort. However, there’s one issue and I need some help.

I’m using the Assistants API and have uploaded a file with questions and answers. When I ask these questions in various forms, the responses are good. But I noticed that in the Playground, after generating a response, it uses approximately 10,000 tokens.

Could you please advise if I am doing something incorrectly and provide some tips on how to optimize this?

Michael_J · May 6, 2025, 6:35pm

Hey, I saw your post and just wanted to drop a quick thought here – maybe it’ll help.

You’re definitely not doing anything “wrong” per se – the system just tends to load a lot when you work with large files + the retrieval tool. If you’re uploading a full Q&A dataset, every response might be referencing the whole thing. That’s what eats up tokens fast.

What worked for me was breaking the data down into smaller topic-specific files (e.g., “Opening Hours,” “Parking Info”), and only referencing what’s needed at the moment.

Also worth looking at is limiting how much previous context you carry forward. You don’t always need the full history – just enough for the assistant to stay coherent.

I’m building a more modular chatbot system myself (nothing fancy, just structure-focused), so if you’re curious to compare notes, happy to share.
Either way – your project sounds like it’s already got good bones.

Keep it up!

Topic		Replies	Views
Token Optimization for Assistants API - Excesive token count API gpt-4 , assistants , assistants-api	2	2783	May 24, 2024
Assistant API - consumes too much prompt tokens. What is the reason and how can I reduce it? API assistants , assistants-api	4	494	August 19, 2024
Assistants API token usage and pricing breakdown clarification API gpt-4 , api , assistants	10	10508	February 6, 2024
Why are my context tokens used so quickly? API api	3	2818	January 5, 2024
Reduce Data Passed to Assistant API api , data-preparation	2	636	February 12, 2024

Issues with High Token Usage in Assistants API for Chatbot Responses

Related topics