API Dashboard Shows More Requests Than Sent via Python SDK

_j · December 13, 2025, 11:44pm

The OpenAI SDK will retry upon timeout. This doesn’t mean that the generation didn’t happen - it means you didn’t receive it.

As one might read in the readme.md..

Retries

Certain errors are automatically retried 2 times by default, with a short exponential backoff.
Connection errors (for example, due to a network connectivity problem), 408 Request Timeout, 409 Conflict,
429 Rate Limit, and >=500 Internal errors are all retried by default.

You can use the max_retries option to configure or disable retry settings:

from openai import OpenAI

# Configure the default for all requests:
client = OpenAI(
    # default is 2
    max_retries=0,
)

# Or, configure per-request:
client.with_options(max_retries=5).chat.completions.create(
    messages=[
        {
            "role": "user",
            "content": "How can I get the name of the current day in JavaScript?",
        }
    ],
    model="gpt-4o",
)

Thus: “more than sent” can actually mean, “the number that were sent; more than were exposed”.

Topic		Replies	Views
Openai.base_client retrying request to chat/completions in x seconds Bugs api , azure-openai	1	3358	August 28, 2025
Vision requests are counting double? API gpt-4-vision	3	491	November 17, 2023
Different results using Response API between Dashboard vs python call API chatgpt , api , responses-api	0	109	September 8, 2025
Identical request input results in different input token counts in the dashboard API token	11	1017	October 15, 2024
Batch api Task runs repeatedly Bugs batch-api	8	374	November 20, 2025

API Dashboard Shows More Requests Than Sent via Python SDK

Retries

Related topics