How do I get token limit per MINUTES and when it will reset?

The request header is returning

'x-ratelimit-limit-requests': '5000',
    'x-ratelimit-limit-tokens': '5000000',
    'x-ratelimit-limit-tokens_usage_based': '5000000',
    'x-ratelimit-remaining-requests': '4999',
    'x-ratelimit-remaining-tokens': '43018',
    'x-ratelimit-remaining-tokens_usage_based': '13190',
    'x-ratelimit-reset-requests': '12ms',
    'x-ratelimit-reset-tokens': '23h47m36.648s',
    'x-ratelimit-reset-tokens_usage_based': '23h56m12.07s',

both reset-tokens and reset-tokens_usage_based is based on TPD limit not TPM limit.

The rate limit has a very peculiar formula, and the reset-tokens is seemingly when the memory state is back to the same as if you had made no requests. No OpenAI documentation lays out how to write code based on the values in headers or their meanings.

Remaining tokens seems as if you’ve been hitting the rate limit, perhaps even getting errors instead of fulfillment, so that may make the refill rate (that one would think is simply 5M tokens per minute) take longer to again be memoryless.

How that formula works: impossible proprietary mystery.

This can help - Squeezing Water from Stone: Managing OpenAI Rate Limits with Request Prioritization | FluxNinja Aperture