Hello everyone,
I have a question regarding the development of my application using the GPTAPI. I’m aiming to allow users to have extended conversations with the chatbot. However, I understand that the API has a token limit that includes not only the latest user and bot messages, but the entire conversation history, as we need to send previous messages to the API every time.
So, what should I do when the conversation becomes too long? Should I omit the initial messages? If so, how can I do this effectively?
Additionally, when I use the chatGPT on the OpenAI website, I’ve noticed that even in long conversations that surely exceed the 4000 token limit, the chatbot can still reference topics from the beginning of the conversation. I’m struggling to understand how this aligns with the context window limit.
Any insights or advice would be greatly appreciated. Thank you!