|
Switching to gpt5.4-nano results in 0% cache hit rate
|
|
0
|
120
|
April 28, 2026
|
|
Inference-time systems proposal: KV-cache relay to eliminate redundant prefill across sub-agents
|
|
0
|
156
|
March 13, 2026
|
|
Prompt caching not working
|
|
10
|
1663
|
March 4, 2026
|
|
Caching is borked for GPT 5 models
|
|
19
|
2849
|
January 8, 2026
|
|
Understanding "prompt_cache_keys" in query efficiency
|
|
5
|
1305
|
November 12, 2025
|
|
We need to talk about prompt caching
|
|
1
|
835
|
October 25, 2025
|
|
Can I cache large chunks on gpt-5-nano?, Does each cache-read request reset cache inactive time?, Does large caches affect cache overflow limits?
|
|
1
|
145
|
October 25, 2025
|
|
Prompt Caching seems not working even if long common prefix in the system prompt
|
|
3
|
575
|
September 23, 2025
|
|
Following Instructions Quality - Developer message and its position, instructions and prompt caching
|
|
1
|
199
|
September 16, 2025
|
|
Prompt caching with tools
|
|
1
|
740
|
September 15, 2025
|
|
New Realtime API voices and cache pricing
|
|
26
|
13230
|
September 2, 2025
|
|
Does structured output schema come BEFORE/AFTER system message for prompt caching?
|
|
3
|
322
|
August 3, 2025
|
|
How to use cached_tokens field to calculate cost estimation
|
|
1
|
976
|
July 31, 2025
|
|
Prompt Cache Routing + the `user` Parameter
|
|
3
|
713
|
July 31, 2025
|
|
Prompt caching doesn't seem to work regularly
|
|
4
|
1035
|
July 13, 2025
|
|
How to improve caching accuracy
|
|
1
|
532
|
July 8, 2025
|
|
System prompt not regard when using web search in OpenAI – Why?
|
|
3
|
783
|
July 6, 2025
|
|
Prompt caching - how many prompts are cached?
|
|
2
|
263
|
June 20, 2025
|
|
4o input not being cached
|
|
42
|
2225
|
April 25, 2025
|
|
Is there a way to disable prompt caching in the APIs
|
|
9
|
8171
|
April 24, 2025
|
|
Responses API not using cached inputs for o3-mini
|
|
0
|
135
|
April 17, 2025
|
|
Automatic context window caching - better performance? When?
|
|
0
|
231
|
April 3, 2025
|
|
Prompt caching enabled in O3-Mini?
|
|
5
|
371
|
March 31, 2025
|
|
Does prompt caching reduce TPM?
|
|
4
|
707
|
March 9, 2025
|
|
How Prompt caching works?
|
|
17
|
10458
|
February 4, 2025
|
|
Dashboard usage vs Prompt response usage not matching
|
|
12
|
923
|
January 9, 2025
|
|
Understanding Prompt caching
|
|
0
|
487
|
January 2, 2025
|
|
Does prompt caching persist between different models?
|
|
1
|
404
|
December 23, 2024
|
|
Why don't we have prompt caching on gpt-4?
|
|
1
|
220
|
November 22, 2024
|
|
Cache not caching more than 1024 tokens (expected: increments of 128 tokens)
|
|
6
|
451
|
November 14, 2024
|