Hello,
I’m a developer working on an API using the gpt-4o-mini-tts
model.
I have a question regarding the cost calculation for input and output tokens.
I understand that the output tokens are audio tokens, but I’m not sure how they are measured or calculated.
Specifically, can audio tokens be calculated using tools like tiktoken
, or is there a different method?
Thank you in advance for your help.