How to limit input tokens of assistant?