How are tokens for a PDF calculated?

manuelmaccou · January 19, 2024, 4:18pm

When using the API to send a PDF file as context to answer a question, how is the token count calculated? Does it identify the relevant text first and then use standard method to apply the token count to the context window? Or is there a more specific way it handles PDFs?

Foxalabs · January 19, 2024, 5:06pm

The exact methods used for retrievals by the assistants API is not documented, so there is no way to properly answer this question. You should make the assumption that the AI and it’s subsystems will either make a RAG retrieval OR a full load of the document (if the text fits into the context 128k limit) at it’s discretion. The assistants API is a beta system that is under development, so expect this to be updated in due course.

Topic		Replies	Views
[Question] How is token counted from retrieval tool? API question , api	2	944	November 14, 2023
What are context tokens? How exactly they are calculated when using Assistant API for retrieval? Is there a way to estimate the amount of context token that can be used when accessing Assistant APIs for retrieval? API assistants-api , assistants-pricing	0	622	March 20, 2024
Assistants API token calculation? API assistants , assistants-api , assistants-pricing	0	889	December 21, 2023
Question about tokens // assistant GPT builders chatgpt	4	54	August 11, 2024
Does `context token` including the uploaded file in Assistant messages? API assistants-pricing	4	1094	March 26, 2024

How are tokens for a PDF calculated?

Related Topics