egils:
Scenario: I have a chatbot with memory built on GPT-4.
Each time user asks new question, I get context from embeddings for this particular question and inject into request.
Example
Q1: Who is CEO?
[from embeddings is retrieved about-us page content and added to context]
A1: CEO is John
Q2: Since when?
[from embeddings gets retrieved absolute irrelevant context for sentence “Since when?” and passed for completion]
A2: generic none-sense
The issue is that follow-up question may not be context aware of what conversation was about so far.
What is recommended approach to provide context to follow-up question in order to retrieve relevant context for completion?
Your issue is simple. You need to create a standalone question to submit on your subsequent queries. I made a fairly rambling video about it here: https://youtu.be/B5B4fF95J9s
This is how it was explained to me: Chat Completion Architechture - #7 by SomebodySysop
2 Likes