The Elephant in the Room: Why No Persistent Conversational Memory in LLMs?

curt.kennedy · February 22, 2025, 9:04pm

This one is already here. It’s called RAG, as you already know! It’s becoming more long-term in ChatGPT, and it uses more tokens as context, but is not a new technology. OpenAI seems willing to add more history in ChatGPT (via brute force or RAG) to add value. So I think this will be given. But it does involve more infrastructure, and obviously more input tokens, which cost more to run. So depending on RAG / history infrastructure costs and additional computing costs … the lower the better … will determine how fast this gets adopted.

I think there was another mention or hint of a non-RAG way of doing it, that just involves compute, without a bunch of DB stuff, and that would be another front-end “preference model” tuned to each user. These small models could be trained regularly to adapt to the user preferences, past histories, and it can even form “memories” of past interactions that would influence the discussion.

What’s cool about this, is that you could export the weights of this model to another vendor, another model or system, and resume your preferences and memories across other models. This would be assuming such models get standardized, and become easily portable. All without big DB transfers, which would require some sort of ETL unique to each DB, and embedding costs and overhead, nobody uses the same embedding model.

If anything, you should start training your own preference model, and using it to compactly store information about you over time. Then use this in conjunction with RAG to “oversee” the entire generation that the LLM is creating.

Topic		Replies	Views
GPT Builder Or Programming Language? Community project	22	560	October 13, 2024
ChatGPT can now reference all past conversations – April 10, 2025 Community announcement , chatgpt , memory	66	13839	April 23, 2025
Moonshot - Predicting the future and making JARVIS! Community	67	7375	November 25, 2023
Biggest difficulty in developing LLM apps API development	75	6660	January 12, 2024
Episodic and declarative memory should probably be separate in AGI Community	12	1457	January 12, 2022

The Elephant in the Room: Why No Persistent Conversational Memory in LLMs?

Related topics