Possible Cache Issue on GPT-5-mini and GPT-5-nano

GabrielSilva · December 3, 2025, 4:23am

Hello everyone,
I wanna report something I been testing these last days, because maybe is helpful for other devs too.

I made a big batch of tests (around 30 runs) using different models: gpt-5.1, gpt-5-nano, and gpt-5-mini.
And right now, only GPT-5.1 is showing consistent caching behavior.

When I tested GPT-5-mini and GPT-5-nano, the cache almost never hits.
For example:
I tried 20 prompts with exact same system input, and only 2 requests got cached successfully.
The rest came with cached_tokens = 0 or just looked like the model didn’t even try to use cache at all.

I was expecting these smaller models to use caching a lot more, since the announcement said they should be more optimized. But in real practice, looks like something is not working correct.

Not sure if this is a general bug, or if it’s just happening for some users.
If anyone else is facing same issue, would be nice to know.

Also, for people who are NOT having this issue:
did you guys change something on your API config, or add some parameter that makes cache hit more often?
Just trying to understand if I’m missing something on my setup.

And if @OpenAI_Support team can check this behavior, it would help a lot. Because for production apps, the cache is super important (especially for nano/mini where cost and speed matters).

DavidDev · December 17, 2025, 4:28am

I can confirm that the caché is broken for all models I tested except gpt-5.1. Not even using prompt_cache_key saves them.

This are my results after 120 tests each:

gpt-5.2
no prompt_cache_key: 45% failure rate.
with prompt_cache_key: 30% failure rate.

gpt-5.1

no prompt_cache_key: 19% failure rate.
with prompt_cache_key: 6% failure rate.

gpt-5-mini

no prompt_cache_key: 72% failure rate.
with prompt_cache_key: 76% failure rate.

gpt-5-nano

no prompt_cache_key: 25% failure rate.
with prompt_cache_key: 28% failure rate.

And more problematic is that using prompt_cache_key, doesn’t have any effect at all in mini and nano, making them even slightly worse…

Topic		Replies	Views
Prompt_cache_key seems inconsistent -- works better on GPT-4o than GPT-5 API api	0	152	October 13, 2025
GPT-4o and GPT-4o-mini Cache Not Working? Bugs	5	344	March 2, 2026
Prompt Caching seems not working even if long common prefix in the system prompt API prompt , gpt-5 , prompt-caching , gpt-41 , gpt-41-mini	3	457	September 23, 2025
We need to talk about prompt caching Feedback prompt-caching , responses-api , chat-completions-api	1	528	October 25, 2025
Issues with gpt-5 caching Bugs api , gpt-5	0	214	August 28, 2025

Possible Cache Issue on GPT-5-mini and GPT-5-nano

Related topics