How to reduce token usage by repeating system prompt each time for batch API

ansonsavage · October 24, 2025, 7:59pm

Howdy! I was just reading through this documentation about the Batch API (https://platform.openai.com/docs/guides/batch).

Is there a way for me to send the system instructions to OpenAI just once (assuming they are the same for each item in the batch) so that I can avoid using tokens that will simply be duplicates?

yusuferen · October 24, 2025, 8:08pm

This may be what you are looking for;

https://platform.openai.com/docs/guides/prompt-caching

Relevant thread in the community:

ansonsavage · October 25, 2025, 4:29pm

Sweet, this is what I was looking for, thanks!

_j · October 25, 2025, 4:42pm

Note that the batch API processes:

on OpenAI’s schedule
likely in parallel across many instances
is already discounted

So essentially, even if there were optimizations by sending system prompts and input that you have sent before, there is no additional discount offered to you of a “cache hit”.

The full text for understanding is always required.

Topic		Replies	Views
Batch API vs Prompt caching API batch-api , prompt-caching	1	1174	October 14, 2024
Can Batch api work with prompt caching? API batch-api	4	1816	December 6, 2024
Prompt Caching in Batching API API batch-api , cache	2	933	April 6, 2025
Batch API - System Prompt Caching - Is it possilbe to cache system prompt from single batch job and reuse it across multiple batches? API batch-api	2	600	June 11, 2025
How to save input tokens in Responses API? API responses	5	1076	May 23, 2025

How to reduce token usage by repeating system prompt each time for batch API

Related topics