Prompt caching not working

jrabelo · November 2, 2024, 7:00pm

Hello, i have a startup witch is basically a chatgpt wrapper and a very clever system-prompt

The problem is that for each user-request i use the same system-prompt which is 1500 input tokens and more 500 to 1000 tokens that are inputs from my user that changes(therefore cannot be cached)

There is any way i can cache my system-prompt so my startup can be more profitable? the profit margins are very bad today since chatgpt costs are 50% of my revenue and i really need to use gpt4o-latest since its the only one that gives me great results

jrabelo · November 2, 2024, 7:02pm

for some reason nothing is being cached

platypus · November 2, 2024, 7:09pm

Hi @jrabelo and welcome to the community!

Are you inserting any dynamic content in the beginning or middle of your system prompt? For example, dates/times, usernames, etc? Basically you need to have at least the first 1024 tokens to be constant for caching to take affect.

Also what is the frequency of API calls? If they are very infrequent, depending on the general API load, they may not be cached.

sps · November 2, 2024, 7:13pm

Welcome to the dev forum @jrabelo

Just to confirm, are you using the chatgpt-4o-latest model?

jrabelo · November 2, 2024, 7:15pm

hello, yes, im using “chatgpt-4o-latest” model

sps · November 2, 2024, 7:16pm

Thanks for confirming. Prompt caching is not available for this model, at the moment.

Here’s directly from the guide:

Prompt Caching is enabled for the following models:

gpt-4o (excludes gpt-4o-2024-05-13 and chatgpt-4o-latest)

gpt-4o-mini

o1-preview

o1-mini

jrabelo · November 2, 2024, 7:17pm

to be more precise, my system-prompt is exactly 998 chars and NEVER change

jrabelo · November 2, 2024, 7:18pm

thanks for your reply, whats the most recent model from gpt4-o i can use with prompt caching?

sps · November 2, 2024, 7:21pm

gpt-4o currently points the latest 4o model gpt-4o-2024-08-06 which supports prompt caching.

jrabelo · November 2, 2024, 7:23pm

thanks you guys so much for the amazingly quick responses

Topic		Replies	Views
Does prompt caching works on chatgpt-4o-latest? Prompting gpt-4 , chatgpt	5	676	December 10, 2024
Prompt caching with multiple agents API	1	787	October 9, 2024
Why don't we have prompt caching on gpt-4? Feedback gpt-4 , prompt-caching	1	144	November 22, 2024
Prompt Caching for o3-mini? API o3-mini	3	568	February 4, 2025
How Prompt caching works? API assistants-api , prompt-caching	17	7517	February 4, 2025

Prompt caching not working

Related topics