Assistant API response messages + token count

sbin · December 13, 2023, 12:47am

In assistant APIs:

Are the API responses or bot messages auto appended to the threads or we have to do it?
How do I get the input and output token count for these API responses like I used to get in completions?

Thanks.

jmportilla · December 13, 2023, 1:31am

Assistant responses should automatically appear in that thread, you can confirm this with:
messages = client.beta.threads.messages.list(thread.id)
I believe the ThreadMessage will have the text value, but not sure it has a token attribute. Maybe check the run or thread object?

sbin · December 13, 2023, 1:31pm

Thank you for that.

OpenAI mentioned that they are planning on adding the token count feature soon

rheinze08 · December 13, 2023, 2:30pm

I don’t think it’s always as simple as “get the last message from the thread”. The messages may appear in sequence. For example, one run can have 2 back-to-back messages from the API if asking it to leverage Code Interpreter to do some data operations on a file. You may have to be operating/checking on the Run itself.

jmportilla · December 13, 2023, 5:04pm

That’s a good point, in which case you can check out the Run Steps:
https://platform.openai.com/docs/api-reference/runs/getRunStep

EricGT · December 17, 2023, 11:49am

As this topic has an accepted solution, closing topic.

Topic		Replies	Views
Do assistants count messages in the thread against the tokens limit? API gpt-4	3	1638	December 17, 2023
Open AI Assistants : how to get the token count? API api , assistants-api , assistants-pricing	16	13613	July 23, 2024
Assistants API - how to get token usage back from messages? API	5	2389	December 14, 2023
How to count TOKEN in a thread with API API assistants-api	3	184	October 26, 2024
Do Assistant-called function outputs count towards input tokens? API	8	1897	January 12, 2024

Assistant API response messages + token count

Related topics