Is 17k Tokens for infromation retrieval from file too much?

gio1 · April 22, 2024, 9:20pm

Hello, I’m building a MVP with assistants.
Created a Voice assistant for specific purposes,

I have 1 text file uploaded which is 77k characters (11400 Words) long and is stored in the vector storage.
My input (instruction) prompt is around 500 Tokens, but whenever Assistant needs to search this text file for information it spends from 17 to 20k Tokens.
Is this normal or do I have a terrible leak somewhere?
P.S Using GPT4 Turbo

Thank you in advance for your help.

anon22939549 · April 23, 2024, 5:13am

The built-in retrieval mechanism is very greedy and it will grab anything and everything it thinks might be even remotely relevant.

At this point, this behaviour should be expected, there really isn’t anything you can do to mitigate this short of going through and trimming the fat from your uploaded document in the hopes there will be fewer tangentially related chunks available for it to ingest.

gio1 · April 28, 2024, 1:54pm

What worked Really well for me is that I converted text file into a pdf, not same file is eating about 1k Tokens.

anon10827405 · April 28, 2024, 4:52pm

It’s a bit counter-intuitive and going against the grain of embedding documents to go from Text → PDF. Especially if you just saved it without performing any work on it. I would actually that there’s something wrong here.

Do you mind sharing this document? From 17k → 1k tokens is quite an accomplishment if the results are just as accurate.

Or, at the least, what kind of document was it? I could maybe see a table document maybe performing better if the text isn’t baked into the document and is better to be read row-by-row.

Idk if my math is right here but 1 word is usually around 1.33 tokens. If your document is 11,400 words then that would mean the complete document is ~15,000 tokens.

gio1 · April 28, 2024, 7:51pm

sure it’s

CHARACTERS

44400

WORDS

5976

SENTENCES

564

PARAGRAPHS

189

SPACES

5792

Simple text, a description of a hotel and it’s amenities

Topic		Replies	Views
Understanding AI Assistant input token counts Prompting gpt-4 , lost-user , assistants-api	5	3437	June 26, 2024
Assistant api using too much tokens Prompting assistants-api	0	979	January 30, 2024
High Costs and Input Tokens with Assistants API File Search API pricing , assistants-api , assistants-pricing , assistants-files	4	1569	October 31, 2024
Unexpectedly High Token Consumption in OpenAI Assistant API	1	167	March 11, 2025
Failed to index file File contains too may tokens. Max allowed tokens per file is 2000000 API api , assistants	23	4480	March 2, 2024

Is 17k Tokens for infromation retrieval from file too much?

Related topics