Detecting When a PDF Should Use File Search vs Full Context

Joao_Abrantes · December 22, 2025, 11:14am

It seems that when we submit PDFs to ChatGPT is has two modes:

If PDF is very large: PDF gets chunked and the LLM receives a `file_search` tool for retrieval.
If PDF is small: The entire content gets stuffed on the message context.

I want to achieve the same result using the Responses API. However, it is hard for us to estimate how large (how many tokens) a PDF is: because it would require us to extract the text and the images.

Has anyone implemented this? Any recommendations?

jeffvpace · December 22, 2025, 4:52pm

https://platform.openai.com/docs/guides/pdf-files

Note: the Supported models section needs updating.

Joao_Abrantes · December 22, 2025, 5:26pm

I read that before posting, obviously.

aprendendo.next · December 22, 2025, 8:26pm

You can try using the input token count endpoint.

It will return something like:
InputTokenCountResponse(input_tokens=402183, object='response.input_tokens')

It will give you an estimate of how many input tokens will be consumed for the given prompt.

Then, you can decide to transform it into a vector store or something else if it is above a certain amount.

Joao_Abrantes · December 22, 2025, 8:48pm

That seems like a promising path but I have a couple of questions on how that works:

If I give the conversation argument then the input is added to the conversation — and then I can no longer decide not to add the PDF to the conversation?

If I don’t give the conversation argument does that endpoint charge me for the input tokens, and then I have to pay again if I do decide to give the PDF? Or is that endpoint free?

aprendendo.next · December 22, 2025, 8:52pm

It is free, as far as I know (as long as there is no abuse).

It won’t affect the conversation, it is basically a request simulation. Even the parameters are basically the same as a response request, and you can measure other types of inputs too.

Joao_Abrantes · December 22, 2025, 8:54pm

Awesome. Will use this. Thanks!

Topic		Replies	Views
How does OpenAI charge tokens when sending PDF content in a prompt? API api	6	1445	December 22, 2025
What is the difference between File Inputs (Attachment) and File Search (Tool)? API gpt-5 , file-search , file-upload	7	570	September 8, 2025
Seeking Advice: Uploading Large PDFs for Analysis with GPT-3 API API gpt-35-turbo , chatgpt , fine-tuning , api	7	7350	December 13, 2023
[Responses API] File Upload in Query vs ChatGPT API file-uploads	3	2942	October 26, 2025
Direct PDF file input now supported in the API Announcements	19	10393	September 17, 2025

Detecting When a PDF Should Use File Search vs Full Context

Related topics