I get that Assistant will manage context for me and all that and therefore the number of tokens used will vary … BUT I need to know how much this is costing? Currently I’m not using retrieval which has semi-clear cost.
I don’t know if a typical chat session costs $0.05 or $5.00! Kind of important!
I’ll assuming:
client.beta.assistants.create is FREE
client.beta.threads.create is FREE
client.beta,threads.messages.create costs SOMETHING
client.beta,threads.runs.create is FREE
client.beta.threads.runs.retrieve is FREE
client.beta.threads.threads.messages.list cost SOMETHING
Why doesn’t the API just return token count when appropriate??
Thanks for any info.