Can Batch api work with prompt caching?

thanawarat.jongchiew · December 6, 2024, 8:45am

Let’s say I created batch with same system prompt more than 1024 tokens. The prompt caching will work same as normal chat completion?

platypus · December 6, 2024, 8:58am

Hi @thanawarat.jongchiew ! Yes it will, and in fact it’s probably the most effective way of utilizing caching, since the batches will be executed back-to-back.

thanawarat.jongchiew · December 6, 2024, 8:59am

It means I can reduce cost by 4 times? 50 % by batch and 50 % by caching.

_j · December 6, 2024, 9:08am

If you look at the pricing page, there are specific prices given for input by context cache, and separate input price given by batch.

https://openai.com/api/pricing/

They are not combined and there is no “cache” pricing under batch.

If caching does work to reduce computation, there is no indication that this would be passed along as savings. Caching also relies on the inputs being run against the same routing to an API server destination, within a time window, which batch may not do or have any understanding of.

platypus · December 6, 2024, 1:59pm

I wouldn’t put any hard numbers or % on savings, but to give you an idea - the last batch jobs we ran corresponded to 50% of our input tokens being cached, almost exactly. Again, take the number there with a grain of salt, but this is how it turned out in our case. So yes, where it makes sense, I would opt for doing a batch job. Obviously this doesn’t make sense for all use cases, but where you are extracting or parsing information from large amount of data and outputting it in structured/JSON form, and you have a decently large static system prompt, this approach would give you the maximum savings.

Topic		Replies	Views
Batch API vs Prompt caching API batch-api , prompt-caching	1	1174	October 14, 2024
Batch API - System Prompt Caching - Is it possilbe to cache system prompt from single batch job and reuse it across multiple batches? API batch-api	2	600	June 11, 2025
Prompt Caching in Batching API API batch-api , cache	2	933	April 6, 2025
How to reduce token usage by repeating system prompt each time for batch API API	3	305	October 25, 2025
Why no prompt caching for batch jobs? Feedback batch	3	470	June 17, 2025

Can Batch api work with prompt caching?

Related topics