Prompt Caching in Batching API

likhitha · April 6, 2025, 8:39am

I’m using GPT-4o for image analysis, and my prompt is quite large (approximately 3,000 tokens). Can I cache this prompt and use it in Batch deployment mode rather than sending the same prompt against each task in the input file?

sps · April 6, 2025, 9:14am

Hi @likhitha,

AFAIK, caching isn’t available for batch API requests as of writing this post.

In the batch input file, every line, which must be a valid JSON object request, is considered a standalone request. Hence, each request must be treated as stateless and contain the prompt and context needed for that individual request to work as desired.

_j · April 6, 2025, 10:17am

Additionally, the benefit to you for activating context window caching on the API currently is a 50% discount on input lengths that match a server’s cache of a similar input.

Not any ability to not need to re-send.

The batch API already has a 50% discount on everything.

The clever OpenAI could identify commonality and run optimized batches behind the scenes, but that is not exposed to you.

Topic		Replies	Views
Batch API - System Prompt Caching - Is it possilbe to cache system prompt from single batch job and reuse it across multiple batches? API batch-api	2	600	June 11, 2025
Can Batch api work with prompt caching? API batch-api	4	1816	December 6, 2024
How to reduce token usage by repeating system prompt each time for batch API API	3	305	October 25, 2025
Batch API vs Prompt caching API batch-api , prompt-caching	1	1174	October 14, 2024
Prompt caching with multiple agents API	1	1155	October 9, 2024

Prompt Caching in Batching API

Related topics