How does th Prompt Caching Prefix Match work?

yashwantk · October 22, 2024, 2:13pm

I am still not clear from the documentation how does the prompt caching matching work.

Scenario 1: I have a prompt which has system_prompt + user_1_message, system_prompt + user_2_message, system_prompt + user_3_message.
Assuming my system prompt is 980 characters, and my user_messages are about 400 tokens each.
In that case, the caching will kick off because we have more than 1024 tokens, but will something be actually cached since we have less than 1024 tokens which are common?

It also mentions, that it matches a prefix. What is the length of the prefix until which it matches?

Scenario 2: Suppose, I have system_prompts, system_prompt_1 and system_prompt_2, both of which are 1200 tokens, and have the first 600 tokens exactly same.

Will anything be cached in this case?
If yes, would there be two entries in the cache? If yes, while selecting the data, what will be the prefix length that would be matched.

Would want to know the details for these so that we can build accordingly.

phyde1001 · October 22, 2024, 2:42pm

Hi yashwantk,

Welcome to the forum

Scenario 1: You must have at least 1024 consecutive tokens the same so no

UNLESS 980 + 400 where the first 44 of the user_messages is the same

Scenario 2: You must have at least 1024 consecutive tokens the same so no
(Cache Fail at 601 of System)

Also it is the matching prefix so if the first character is different and the rest the same you have a miss

It is System+User (First 1024 combined)

“Cache hits are only possible for exact prefix matches within a prompt.”
https://platform.openai.com/docs/guides/prompt-caching

Topic		Replies	Views
How does Prompt Caching work? Prompting api , prompt-caching	8	8149	October 29, 2024
Prompt caching - how many prompts are cached? API prompt-caching	2	216	June 20, 2025
Understanding "prompt_cache_keys" in query efficiency API prompt , prompt-caching	5	960	November 12, 2025
Understanding Prompt caching API prompt-caching	0	416	January 2, 2025
Prompt Caching seems not working even if long common prefix in the system prompt API prompt , gpt-5 , prompt-caching , gpt-41 , gpt-41-mini	3	465	September 23, 2025

How does th Prompt Caching Prefix Match work?

Related topics