Return inference cost in cost_in_usd_ticks

fred.fischer1 · May 5, 2026, 8:01pm

I just noticed that xAI returns the actual billed inference cost in the response JSON:

"usage": {
"prompt_tokens": 151,
"completion_tokens": 4,
"total_tokens": 749,
"prompt_tokens_details": {
"text_tokens": 151,
"audio_tokens": 0,
"image_tokens": 0,
"cached_tokens": 128
},
"completion_tokens_details": {
"reasoning_tokens": 594,
"audio_tokens": 0,
"accepted_prediction_tokens": 0,
"rejected_prediction_tokens": 0
},
"num_sources_used": 0,
"cost_in_usd_ticks": 15493500
},

“cost_in_usd_ticks” is the actual inference cost in “ticks” (10^10 ticks per dollar).

I think this would be a useful value for all model providers to return. Right now I have to compute it myself from the returned token counts, but it would be better to get an authoritative answer directly from the provider especially given the complications with caching and internal tools like web searches and grounding.

Topic		Replies	Views
Feature request: Include cost of API call (cents) in API responses Feedback	0	104	October 4, 2024
Proposal: Introducing an API Endpoint for Token Count and Cost Estimation Feedback api	4	1641	September 22, 2024
About how to calculate API call costs more conveniently Feedback api	0	515	November 7, 2023
Feature request: Token usage in DALLE API responses API	3	1155	December 29, 2023
Display Cost in Fine Tunes API API	2	347	September 1, 2023

Return inference cost in cost_in_usd_ticks

Related topics