When we use previous_response_id
, I am wondering if there is any way to manage the context window. For example, I want the entire conversation to use just 10K tokens so it can truncate itself.
I remember the Assistant API had something like a last_messages
to specify number of last message it would include.