Question on tokens per message and cost in OpenAI API

sixuan · August 14, 2024, 6:50pm

I’m writing a function to display cost for each API call using tiktoken for learning purpose. I learned about tokens per message, which is fixed overhead tokens for each message that doesn’t depend on the length of the message content; For gpt-4o, this is 3 tokens.

Then I’m confused about how tokens are calculated. Since
OpenAI adds 3 tokens per message as overhead, while other APIs such as Anthropic’s Claude 3.5 Sonnet count the actual content.

Given this, wouldn’t GPT-4 be significantly cheaper in most cases? Am I missing something in this comparison?

GPT-4o: $5/million input tokens, $15/million output tokens
Claude 3.5 Sonnet: $3/million tokens (flat rate)

jlvanhulst · August 14, 2024, 7:12pm

I think you don’t have to worry about 3 tokens overhead (or not) given the pricing in dollars per million tokens? Most requests would be hundreds if not thousands of tokens. Also - you can see tokens in and out for each request in the back end when using the Assistants API - each thread is displayed with token count.

Topic		Replies	Views
ChatGPT4o API Pricing for Input and Output API	5	68022	January 1, 2025
Unexpected High Token Usage on OpenAI API Community gpt-4 , chatgpt , api	1	339	January 26, 2025
I just want to know how the OpenAI model calculates token usage (Assistant API KEY) API chatgpt	1	229	July 12, 2024
Token Optimization for Assistants API - Excesive token count API gpt-4 , assistants , assistants-api	2	2918	May 24, 2024
Message credits calculation for openai API API	3	4301	August 22, 2023

Question on tokens per message and cost in OpenAI API

Related topics