GPT-4o and GPT-4o-mini Cache Not Working?

davidia · February 28, 2025, 9:49am

Hi everyone,

Today I noticed that the cache is not working for me with both GPT-4o and GPT-4o-mini. My code is exactly the same as before. The initial prompts I’m using are long enough to trigger the cache on successive calls, and caching was working correctly before.

Is anyone else experiencing the same issue? Any insights would be appreciated!

Thanks!

sps · February 28, 2025, 9:55am

Hi @davidia,

Thanks for reporting.

I just tested it, and it seems to be working on gpt-4o but not on gpt-4o-mini.

davidia · February 28, 2025, 10:15am

Thanks for checking! I tested again, and caching now works with GPT-4o—not sure if it just started working or if I missed it before. But GPT-4o-mini is still not working.

vinicius.cestari.h · November 18, 2025, 5:15pm

Just tested it today (18/11/2025) and GPT-4o-mini Cached Input is still not working.

kailashprj2209 · March 2, 2026, 12:38pm

First updated the openai library to the latest version and try this it will work.

response = openai.ChatCompletion.create(
model=“gpt-4o-mini”,
messages=[{“role”: “system”, “content”: “static instructions…”}, {“role”: “user”, “content”: “Your question”}],
prompt_cache_retention=“in_memory”
)

Topic		Replies	Views
Prompt caching broken for GPT 5.4 and 5.5 Bugs	4	234	June 13, 2026
Possible Cache Issue on GPT-5-mini and GPT-5-nano Bugs gpt-5	1	452	December 17, 2025
Caching rate drop after switching to Responses API API cache	2	334	January 9, 2026
Prompt_cache_key seems inconsistent -- works better on GPT-4o than GPT-5 API api	0	232	October 13, 2025
Prompt Caching Not Working for GPT-5.4-Nano Bugs api	1	158	May 24, 2026

GPT-4o and GPT-4o-mini Cache Not Working?

Related topics