all of the prior messages are sent to the model each time, if you have stored files that are also used as context, those files can be used as well, again adding to the token count.
So, to clarify, the underlying AI is stateless. The assistants API manages your past conversations behind the scenes, but those past messages need to be sent to the model for every API call.