How to manage chat history effectively?

TW-DucNguyenMata · February 9, 2025, 11:55am

I’m using the OpenAI API to build a custom chat system and have a few ideas for handling conversation history:

Send the entire message history to OpenAI and rely on prompt caching for optimization.
Truncate the middle of the conversation, keeping only the first two and last two messages.
- Update user expectations based on the latest response.
- Use a mini RAG system to manage the context of the truncated middle messages.
Any ideas for this …

Would these approaches be effective, or are there better ways to handle context efficiently?

_j · February 9, 2025, 1:09pm

Prompt caching has a limited lifetime on the server, about 5-60 minutes between queries - with the identical beginning to the messages list that are large enough.

So:

Shorten conversation history more proactively when they have grown long AND when a chat session is re-initiated after an hour and you are no longer going to get a discount anyway.

TW-DucNguyenMata · February 9, 2025, 1:18pm

Thanks for your information.

Summary would be good and simpler approach.

Topic		Replies	Views
Strategy for chat history, context window, and summaries API	4	8781	December 17, 2023
Has anyone brainstormed a cost efficient way to include the chat history for conversation-based applications? API	8	3698	July 21, 2023
How does ChatGPT store history of chat Prompting api , summarize-text	5	21153	December 17, 2023
How long is it optimal to keep the message for the chat history API	3	1414	April 23, 2023
Short lived memory for chatbot API	1	950	August 22, 2023

How to manage chat history effectively?

Related topics