How to calculate cost of Assistant call

peterfromm · December 3, 2024, 8:16am

I am trying to figure out what a call to my assistant costs. From the completed event, I can get the usage:

Usage: {
  prompt_tokens: 1661,
  completion_tokens: 37,
  total_tokens: 1698,
  prompt_token_details: { cached_tokens: 0 }
}

I am using model gpt-4o-mini-2024-07-18 right now, which has the following cost according to the pricing page:

$0.150 / 1M input tokens
$0.075 / 1M cached** input tokens
$0.600 / 1M output tokens

I am confused by the difference in terminology. Are prompt tokens input tokens? Are completion tokens output tokens? Or is there additional math involved in getting the price of the call?

ALBERT_MARTIN · May 7, 2025, 12:48pm

i have the same doubt on it.
the prompt_token is seems to be the total token like the input ,the instruction and files.
completion token is the response.
no idea even the cache is working. They said it is auto.

Topic		Replies	Views
Pricing model Open AI Assistants API - Caching tokens API assistants-api	0	224	December 3, 2024
How to correct compute the cost of an o1 model API call? API	1	347	January 23, 2025
How do I calculate the usage cost when using the GPT-4o-mini-TTS model? API assistants-api	5	1225	May 18, 2025
Cost of Assistant API with GPT-4-1106-preview API gpt-4 , api	0	1085	January 23, 2024
Prompt tokens in response do not match with dashboard input tokens statistics API assistants-api	0	140	February 18, 2025

How to calculate cost of Assistant call

Related topics