Assistant Model Thread History

berrycoder.0 · February 17, 2025, 12:09am

Hi,
When adding messages to a thread and running the thread, does this process include everything in the thread history and feed it to the model? If yes, does this mean that all tokens (or maybe the ones not truncated) count towards the input token count?
I saw a few posts (it does not allow me to include links in my post for some reason) on how the assistant model’s API Pricing and token usage, but was never able to find an answer and those posts are closed.
Thank you!

Foxalabs · February 17, 2025, 12:25am

all of the prior messages are sent to the model each time, if you have stored files that are also used as context, those files can be used as well, again adding to the token count.

So, to clarify, the underlying AI is stateless. The assistants API manages your past conversations behind the scenes, but those past messages need to be sent to the model for every API call.

Topic		Replies	Views
Does the pricing for the Assistant API charge only for the latest message and its output, or does it also include the cost of the entire conversation history within a thread? API assistants-pricing	3	1796	October 23, 2024
Do assistants count messages in the thread against the tokens limit? API gpt-4	2	1772	December 17, 2023
Does Assistants API save cost for back-and-forth conversation? API assistants-api	4	2837	December 16, 2023
Max number of tokens a Thread can use equal the Context Length of the used model? API	3	915	December 1, 2023
When fiering runs is the thread tokens counted again every time? API assistants-api	7	214	January 29, 2025

Assistant Model Thread History

Related topics