Caching is borked for GPT 5 models
|
|
10
|
794
|
October 20, 2025
|
Prompt Caching seems not working even if long common prefix in the system prompt
|
|
4
|
221
|
September 23, 2025
|
Following Instructions Quality - Developer message and its position, instructions and prompt caching
|
|
1
|
98
|
September 16, 2025
|
Prompt caching with tools
|
|
1
|
172
|
September 15, 2025
|
Understanding "prompt_cache_keys" in query efficiency
|
|
2
|
213
|
September 11, 2025
|
New Realtime API voices and cache pricing
|
|
27
|
11045
|
September 2, 2025
|
Does structured output schema come BEFORE/AFTER system message for prompt caching?
|
|
3
|
123
|
August 3, 2025
|
How to use cached_tokens field to calculate cost estimation
|
|
1
|
330
|
July 31, 2025
|
Prompt Cache Routing + the `user` Parameter
|
|
3
|
448
|
July 31, 2025
|
Prompt caching doesn't seem to work regularly
|
|
4
|
505
|
July 13, 2025
|
How to improve caching accuracy
|
|
1
|
189
|
July 8, 2025
|
System prompt not regard when using web search in OpenAI – Why?
|
|
3
|
389
|
July 6, 2025
|
Prompt caching - how many prompts are cached?
|
|
2
|
148
|
June 20, 2025
|
4o input not being cached
|
|
42
|
1639
|
April 25, 2025
|
Is there a way to disable prompt caching in the APIs
|
|
9
|
6389
|
April 24, 2025
|
Responses API not using cached inputs for o3-mini
|
|
0
|
99
|
April 17, 2025
|
Automatic context window caching - better performance? When?
|
|
0
|
139
|
April 3, 2025
|
Prompt caching enabled in O3-Mini?
|
|
5
|
298
|
March 31, 2025
|
Does prompt caching reduce TPM?
|
|
4
|
363
|
March 9, 2025
|
How Prompt caching works?
|
|
17
|
8527
|
February 4, 2025
|
Dashboard usage vs Prompt response usage not matching
|
|
13
|
620
|
January 9, 2025
|
Understanding Prompt caching
|
|
0
|
346
|
January 2, 2025
|
Does prompt caching persist between different models?
|
|
1
|
200
|
December 23, 2024
|
Why don't we have prompt caching on gpt-4?
|
|
1
|
155
|
November 22, 2024
|
Cache not caching more than 1024 tokens (expected: increments of 128 tokens)
|
|
6
|
286
|
November 14, 2024
|
Gpt-4o-2024-08-06 randomly fails to cache tokens
|
|
7
|
180
|
November 12, 2024
|
Improving Cache Management: Handling Tool Removal in Active Conversations
|
|
0
|
39
|
November 11, 2024
|
Prompt caching not working
|
|
9
|
1246
|
November 2, 2024
|
How does Prompt Caching work?
|
|
8
|
6375
|
October 29, 2024
|
Regarding the Issue of Half-Priced Prompt Caching
|
|
5
|
753
|
October 25, 2024
|