Strategy for chat history, context window, and summaries

dror · April 12, 2023, 7:10pm

I’m creating an app using the API to teach users a foreign language, by having them chat with GPT.

I’m trying to replicate, to some extent, the ChatGPT web approach, so that the user feels like the AI remembers what they’ve been talking about.

I’m using GPT3.5 to keep costs down and also for response speed.

The limiting factor is that I want to make the app scalable which means that I’d like to keep the number of tokens used for each interaction reasonable, and not keep the full history for each request.

So the standard approach is to:

Include the previous interactions in the current requests. I’ve done this before and am familiar with the process of using the openai Chat API.
When reaching a certain threshold,

get GPT to summarize the thread
Feed GPT with the summary (as part of the prompt?), and “start a new thread”

i"m interested in hearing other folks experiences with this approach and experimenting with different number of interaction, summarization approach, and other insights.

dror · April 12, 2023, 8:05pm

@raul_pablo_garrido_c , can you provide details on how the MS service helps?

curt.kennedy · April 12, 2023, 8:24pm

I think you are on the right track. Summarizing older parts of the conversation, managing tokens, etc.

But there are other strategies (sorry, haven’t tried them all myself).

But having an AI categorizer that detects new topics (so trash most of the older history and “reboot”)

And embeddings, so relate current topic to past related topics through embeddings.

These are the other things you might try to keep the conversation flowing beyond summarization.

dror · April 12, 2023, 10:32pm

@curt.kennedy this is helpful feedback. I was aware of using embeddings, but the idea of an AI categorizer that detects new topics sounds like a really good suggestion.

Topic		Replies	Views
Differnet ways to Summarize the user Chat History API gpt-4 , api	4	4339	March 10, 2024
Managing Context in a Conversation Bot with Fixed Token Limits API gpt-4 , api	2	683	January 16, 2025
Has anyone brainstormed a cost efficient way to include the chat history for conversation-based applications? API	8	3528	July 21, 2023
How to manage chat history effectively? API long-context	2	634	February 9, 2025
Maintaining Context in Long-Running GPT-4o API Conversations for Executive Desktop Application? API gpt-4 , chatgpt , api	0	135	April 29, 2025

Strategy for chat history, context window, and summaries

Related topics