Unable to access ratelimit headers in Node SDK due to exception

lemonberrylabs · April 21, 2025, 8:33pm

I’m using the OpenAI Node.js SDK and have encountered an issue when handling rate limits. Specifically, when a 429 error occurs, an exception is thrown, and the exception object does not include rate limit headers such as x-ratelimit-reset-requests. This makes it challenging to implement proper retry logic based on the server’s suggested wait times.

This is on the responses.create function, and happens both with and without withResponse()

The exception object does contain some headers, but not the ratelimit headers, e.g.:

{
  "status": 429,
  "headers": {
    "alt-svc": "h3=\":443\"; ma=86400",
    "cf-cache-status": "DYNAMIC",
    "cf-ray": "[redacted]",
    "connection": "keep-alive",
    "content-length": "354",
    "content-type": "application/json",
    "date": "Mon, 21 Apr 2025 19:13:46 GMT",
    "openai-organization": "[redacted]",
    "openai-processing-ms": "5977",
    "openai-version": "2020-10-01",
    "server": "cloudflare",
    "set-cookie": "[redacted]",
    "strict-transport-security": "max-age=31536000; includeSubDomains; preload",
    "x-content-type-options": "nosniff",
    "x-request-id": "[redacted]"
  },
  "request_id": "[redacted]",
  "error": {
    "message": "Rate limit reached for gpt-4.1 in organization [redacted] on tokens per min (TPM): Limit 30000, Used 18271, Requested 16921. Please try again in 10.384s. Visit https://platform.openai.com/account/rate-limits to learn more.",
    "type": "tokens",
    "param": null,
    "code": "rate_limit_exceeded"
  },
  "code": "rate_limit_exceeded",
  "param": null,
  "type": "tokens"
}

Is there a recommended approach to access these headers when a 429 error is thrown? Alternatively, is there a way to prevent the SDK from throwing an exception so that I can inspect the full response, including headers?

Right now my best effort way is to parse the error message that looks like

"Rate limit reached for gpt-4.1 in organization [redacted] on tokens per min (TPM): Limit 30000, Used 18271, Requested 16921. Please try again in 10.384s. Visit https://platform.openai.com/account/rate-limits to learn more"

Which is obviously really bad practice. Any guidance on how to handle this scenario effectively would be appreciated.

Topic		Replies	Views
GPT-4 API - Confusing Ratelimit Headers API	2	2083	March 20, 2023
No Rate Limits in Assistants API API api , assistants-api	0	295	March 18, 2024
Persistent Rate Limit Errors Despite Implementing Ephemeral Token Caching API api-realtime	0	160	January 15, 2025
429 that should be a 500 in chat endpoint (GPT-4) API	6	1565	December 18, 2023
Rate limit error when not over the rate limit API chatgpt	3	2463	December 15, 2023

Unable to access *ratelimit* headers in Node SDK due to exception

Related topics

Unable to access ratelimit headers in Node SDK due to exception