Hello,
I’m a developer working on an API using the gpt-4o-mini-tts model.
I have a question regarding the cost calculation for input and output tokens.
I understand that the output tokens are audio tokens, but I’m not sure how they are measured or calculated.
Specifically, can audio tokens be calculated using tools like tiktoken, or is there a different method?
Thank you in advance for your help.