Batch file documentation error - and many more documentation issues added

_j · April 29, 2025, 10:23am

You will find that a Python execution environment doesn’t appreciate JavaScript’s “const” very much…

What I need to find by doing (and then waiting 24h) instead of by reading anywhere it is offered: the files endpoint purpose of a batch results file output.

Documentation about the purpose in batches results, fine tune results, or any central listing of all purposes and their upload or download capability is missing.

…this answer, btw:

Some of these dox would be good with a “get notebook” cookbook link that could run everything discussed. Perhaps even for batches, a block to sit at a 10 minute poller.

_j · May 2, 2025, 2:57pm

Since this has not been fixed, I bump with another documentation issue here:

https://help.openai.com/en/articles/5072518-controlling-the-length-of-openai-model-responses

It states: For reasoning models like o3, o4-mini, and gpt-4.1, use max_completion_tokens.

However, gpt-4.1 is not a reasoning model, and max_tokens works fine on chat completions. Truncated where expected:

prompt> are you a reasoning model?
assistant> I am a reasoning-aware language model designed to understand and generate human-like text. I can assist with reasoning tasks, problem-solving

Additionally, that page requires mentioning that max_completion_tokens does not deal with the topic title “Controlling the length of OpenAI model responses”, but rather, sets the maximum cost of output, including unseen reasoning, and must be set much higher - dictating a separate usage section there specifically for reasoning AI models.

Then note the prediction tokens as an output cost, and the actual effect of sending more unmatched prediction than your max_completion_tokens.

_j · May 2, 2025, 3:19pm

Here is even more inconsistency against fact.

https://help.openai.com/en/articles/10910291-api-organization-verification

The validation page says:

Once verified, your organization will be able to use o3 with Streaming responses and have access to our GPT Image model.

Additionally, you will be able to access Reasoning Summaries when using the Responses API for o1, o3-mini, o3 and o4-mini.

Some organizations may already have access to these models and capabilities without having to go through the Verification process. To check if your organization has access, view the Limits page or test in the Playground.

“To check if your organization has access”, it says, look here:

We determine that to be a falsehood if you refuse to submit to this overreach.

The model name is still shown in limits, so contrary to document, that cannot show if validation has released a model for use.

Is it the missing RPM alone that is an indicator? How about this org with RPM then? Nope.

There is also no indication other than your failing API requests that you are denied reasoning summaries.

_j · May 3, 2025, 7:47pm

Even more documentation errors. Assistants API Reference, missing the object contents for file search:

Which should be:

    tool_resources={
        "file_search": {
            "vector_store_ids": [vector_store_id]
        }
    },

_j · May 6, 2025, 3:40am

Just more shouting into the abyss here.

API Reference → Responses → Delete response ID

It shows an invalid method .del()

Didn’t think this reserved keyword would actually be in the Python library as a method without it failing with syntax errors:

So in fact, we go to openai/resources/responses/responses.py to see the correct usage..

    def delete(
        self,
        response_id: str,
        *,
     
        timeout: float | httpx.Timeout | None | NotGiven = NOT_GIVEN,
    ) -> None:
        """
        Deletes a model response with the given ID.
        """"

_j · May 6, 2025, 11:58pm

For the create image edit endpoint, API Reference shows this parameter:

However, it simply cannot be passed to the API:

Trying “quality”: “standard” against dall-e-2 and the Python SDK library:

        if model == "gpt-image-1":
            image_params.pop("response_format", None)
        elif model == "dall-e-2":
            image_params.pop("quality", None)
            image_params.pop("background", None)
            image_params["quality"] = "standard"  # trying this alleged default
        else:
            raise ValueError("edit_image received unknown model")

We observe the documentation to present a falsehood.

Others have reported the same on gpt-images-1: no quality support of low, medium, high like the documentation also presents.

The search API has no Python documentation, and the format/type the discovered SDK method is returning is quite different than the underlying JSON.

Showing here: My metadata isn't correctly processed when uploading a JSON file - #2 by _j

_j · May 7, 2025, 7:53am

Multiple misspellings of “guarantee” in API reference for chat completions.

_j · May 7, 2025, 10:00am

I can just keep piling on things done poorly, this one for ChatGPT.

GPT-4.5 is getting a nonsensical message about dalle in its system message, which doesn’t exist for it - or anything else other than the DALL-E GPT:

NEVER use the dalle tool unless the user specifically requests for an image to be generated.

and: annoying “helpful followup” solicitations. It’s wasting input and output tokens. (Where I asked for that system message reproduction to counter this.)

_j · May 8, 2025, 4:37pm

More bad documentation to report. This is about API vision. https://platform.openai.com/docs/guides/images-vision?api-mode=chat

These requirements just seem bizarrely out of touch. Not only are larger images allowed (but face resizing by the API), but certainly text and logos can be sent (or: put it in legal).

I can prove that immediately wrong, as do the examples later in the same page referring to sending images like 4096 x 8192.

This is arrived at with my trying in earnest to make a pricing calculator that aligns with documentation, only possible from a lot of experimentation and prior issue reports.

Tiles models

Sending an image 32,000px by 32px to gpt-4.1 (alternating 32x32 black and white):

The AI can see the patterns of what would become 2048x2 if you follow the formula page for resizing.

gpt-4.1 gives 779 input tokens reported by API on 32000 x 32. Aligning with calculations

Improper resize on tiled images, or really 2000 pixel limit?

I’m going to send this image. 8 tiles are used because 513 is over the 512px tile size.

Any resizing of 2048px to 2000px done would affect that smaller dimension, also, then being one tile high instead of two, but that doesn’t happen:

Note that we had to live for two weeks with bad documentation saying gpt-4.1 was “patches”.

Patches Models

These are ones with a 32x32 grid that is one token, and resizing until reaching max 1536 tokens.

gpt-4.1-mini however is dumbfounded at the same image previously described correctly, complete hallucination:

And then the tokens billed, or any size restriction, doesn’t align to anything in the pricing formulation section OpenAI provides - which doesn’t actually have a formula or algorithm, just guesswork for you.

one word, one 51200x32 image, similar billing:

We extrapolate back, and it may be only “patches” models that are having a 2000px size limit placed on them, despite that the model could take 1536 tokens of vision sent in a different dimensionality instead of a 63 token cap. If tiles models were similarly affected with resizes between 2000 and 2048, that would break the later documentation in the same page.

Conclusion

Input size as a limit is wrong
Patches resizing is not documented
Tiles is not even resized to the stated “limit”
2000 pixels is a bad limit, it is 62.5 tiles. Could have been 64 tiles evenly. Or images could not be limited in dimension like that, but instead let the other algorithms keep whatever is sent under the 1536 token budget.
documentation needs to be rewritten, just like everything else noted here.

Appendix

gpt-4.1-mini — that boy just ain’t right

This shows it isn’t just a fluke of skinny images resized to nothing: gpt-4.1-mini model just cannot see the checkerboard even, that we can plainly see it in the Prompts Playground preview.

_j · May 11, 2025, 10:10am

Another documentation issue, this time the database powering https://platform.openai.com/docs/models/gpt-4-turbo

No image input for “gpt-4-1106-vision-preview”??

Does this friend that accepts full images at 85 tokens base still have people with access, but not us?

Error: Error code: 404 - {'error': {'message': 'The model gpt-4-1106-vision-preview has been deprecated, learn more here: https://platform.openai.com/docs/deprecations', 'type': 'invalid_request_error', 'param': None, 'code': 'model_not_found'}}

_j · May 15, 2025, 7:06am

Images endpoint API reference

images.generate Python example shows writing bytes for gpt-image-1.

However, there is no response_format object example, and no code demonstration for URL image return type.

“pay for several images to figure this out?”

Images’ example and response could use a selector for models like chat completions.

Is anything at all fixed, May 14, 16 days later?

Batch docs Python example uses javascript const: no (https://platform.openai.com/docs/guides/batch#4-check-the-status-of-a-batch)
gpt-4.1 is a reasoning model?: no (https://help.openai.com/en/articles/5072518-controlling-the-length-of-openai-model-responses)
check your validation model access via org’s limits: no (https://help.openai.com/en/articles/10910291-api-organization-verification), (https://platform.openai.com/settings/organization/limits)
tool_resources → file_search API parameters missing: no (https://platform.openai.com/docs/api-reference/assistants/createAssistant)
API ref, Responses “delete” SDK method example wrong: no (https://platform.openai.com/docs/api-reference/responses/delete?lang=python)
“standard” quality rejected by dall-e-2 API: no (but reformatted) (https://platform.openai.com/docs/api-reference/images)
Chat Completions & Responses"service tier" misspellings: no (https://platform.openai.com/docs/api-reference/responses/create)
Vision “size limits” wrong in several ways: no (https://platform.openai.com/docs/guides/images-vision?api-mode=chat)
Shutoff gpt-4-1106-vision-preview in models docs: no (https://platform.openai.com/docs/models/gpt-4-turbo)

OpenAI yaml: April 29.

_j · May 22, 2025, 10:23pm

At the bottom of the MCP documentation page, the endpoints that the internal MCP remote tool are available on:

All tools in docs seem to have this addition. It’s a supported list that also has the unsupported endpoints in the list with little visual distinction. Better to just leave others off. Or put unsupported into a different column.

If we check which tools ARE available on Chat Completions, we’ll find a “Web search” entry. This is technically not a tool to employ on Chat Completions:

Web search is not a tool on chat completions. It is a model, or rather it is a non-model that can do little but repeat search results, with constant additional billing, that one would not use as a product.

katiagg · May 28, 2025, 10:49am

Hi @_j, thank you so much for these reports, this is super helpful, and so sorry this thread went unnoticed for so long.

I’ve fixed some of these that will be deployed soon and we will look into the rest asap. Please bear with us while we get all of this sorted!

_j · May 28, 2025, 3:03pm

To clarify my diatribe about vision above.

“patches” also is seemingly a 2048 max box, downsized. 2000 in the documentation about a maximum input seems to be a number plucked out of thin air.

katiagg · May 28, 2025, 3:13pm

This number is indeed a relic from previous models, we don’t have size requirements anymore and input images are automatically resized. We’re clarifying the cost calculation part and will update the docs accordingly once we have confirmed this!

_j · May 28, 2025, 3:24pm

Models documentation

Here’s another buggy bug. documentation → models → gpt-4-turbo

https://platform.openai.com/docs/models/gpt-4-turbo

Two entries for 0125 that do odd things when clicked.

The models page doesn’t help clarify the odd mapping of alias pointer to name.

A correct mapping for those models still generally available:

"gpt-4-turbo-2024-04-09": {
    "vision": True,
    "aliases": ["gpt-4-turbo"],
},
"gpt-4-0125-preview": {
    "aliases": ["gpt-4-turbo-preview"],
},
"gpt-4-1106-preview": {
    "aliases": [],
},

API issue: Responses is failing on this 0613 model accepted by Chat Completions:

OpenAI library version: 1.82.0
Exception occurred: Error code: 404 - {'error': {'message': 'Model not found gpt-3.5-turbo-16k', 'type': 'invalid_request_error', 'param': 'model', 'code': None}}

Also this is a disingenuous promise - that OpenAI has broken constantly for the past two years:

Snapshots let you lock in a specific version of the model so that performance and behavior remain consistent

katiagg · May 28, 2025, 4:06pm

This is a typo - it should say gpt-4-1106-vision-preview for the 2nd one, thanks for flagging.
Will look into the model not accepted as well.

Could you explain what you mean by “Also this is a disingenuous promise - that OpenAI has broken constantly for the past two years”.

If you use a snapshot name (not an alias), the underlying model is the same so the behavior should remain consistent.

_j · May 28, 2025, 4:17pm

OpenAI constantly alters the performance of “snapshot” AI models, making them not a snapshot in time, whether by post-training or by algorithmic changes, and breaking developer applications. They are “stealthed”.

Most recently..

At best, they can be called “versioned” to contrast between “alias”. The last time a pointer meant anything was by repointing to gpt-4o-2024-08-06. (correction: more recently, updating “gpt-4o-audio-preview” to “gpt-4o-audio-preview-2024-12-17” without fanfare )

More fun with AI model sneak attacks. When all the fingerprints change.

Time to run again - and fingerprints are simply not returned for many models:

– result report from 10 trials:
gpt-3.5-turbo-0125: (10):
gpt-3.5-turbo-1106: (10):fp_982035f36f
gpt-4-turbo-2024-04-09: (10):fp_de235176ee
gpt-4-0125-preview: (10):
gpt-4-1106-preview: (10):

katiagg · May 30, 2025, 10:37am

The underlying models behind snapshots don’t change, that’s why it is called snapshot (and snapshot != fingerprint).
If you found that behaviour changed in the past, it might be due to code changes that affect the model’s output and it should be considered a bug, so if you have specific examples of this feel free to share!

The alias points to a snapshot, often to the latest but it can also point to an older one. The pointer can change over time, that’s why we say if you would like to lock in a specific version you should use snapshot names and not aliases.

Hope that clarifies things.

Also, wanted to let you know that the team has looked into vision cost calculation and it is indeed a little different than what is stated on the docs (the calculation is a bit more complicated than that), so we will update the docs to reflect the precise formula. Thanks!

_j · May 30, 2025, 2:56pm

I have followed through on a pricing calculator that seems infallible in terms of delivering the patches count and the pricing (when images are sent according to API reference).

Its javascript is procedural and readable instead of minified and obfuscated (like that powering the bottom of the pricing page, hidden under an “expand”). However, it has procedural conditional statements that may not be the exact way it is done on API inputs.

Awesome documentation would be:

We offer this python function image_dimensions_to_patches(x,y), delivering the exact patches count received for any additional image input, using our exact algorithm. Multiply its integer output by the model’s cost multiplier (and round [up/down]…).

Also as an output could be the new dimensions - for it is practical to receive the downsized image dimensions, to determine if features could still be observed by AI. And simply perform that same resizing client-side where it is observable.

Topic		Replies	Views
"Invalid image" error in gpt-4-vision API api , gpt-4-vision	38	15776	October 9, 2024
Image_url is only supported by certain models Bugs api	24	6406	February 18, 2025
Using image URL in images/edits request API dalle2	54	23438	February 6, 2024
How to generate a new image using an existing image as input (like ChatGPT does with GPT-4o)? API api	5	294	July 30, 2025
DALL-E 2/3 create, DALL-E 2 edit tool in Python for you Community dalle2 , image-generation , dall-e	8	331	May 17, 2025