Why Do Vision Models Count Correctly in UI But Not Via API?

hl9020 · May 5, 2025, 2:41am

I’m experiencing a persistent issue with object counting using OpenAI vision models:

The problem: When analyzing the exact same image with 28 coins:

ChatGPT UI (o4-mini, o3, GPT-4o): Consistently counts 28 coins correctly
API/Playground (o4-mini, o3, GPT-4.1): Always returns incorrect counts (25, 30, 35)

I’ve extensively tested various parameters in API calls:

Different temperature values (0-0.5)
All reasoning_effort settings
Adjusted max_tokens (10-4000)
Various prompting strategies
Stripped down system prompts to bare minimum

Despite identical images and near-identical prompts, the UI consistently succeeds where the API fails. Our backend uses openaiService.js with a standard system prompt that we’ve progressively simplified.

Has anyone else encountered this discrepancy between UI and API for vision counting tasks? Are there hidden UI parameters or different model versions being served?

lucid.dev · May 5, 2025, 4:13am

That’s very interesting.

My guess the ChatGPT UI is providing some kind of middleware processing, as it often does. Counting and such if often a known issue with LLMs, so presumably they haven’t got it right in the actual model yet and are using some kind of middleware or additional tool call/reasoning solution within the webapp to overcome this?

Topic		Replies	Views
Vision token counts does not correspond to the documentation Bugs token , api-vision	3	203	December 30, 2024
The performance difference between ChatGPT4o and gpt4o api using the same prompt for image analysis API gpt-4 , chatgpt , gpt-4-vision , gpt4-vision , api-vision	5	1056	July 27, 2024
Is GPT4-o dumber in Assistans API than in normal chat? API gpt-4o	3	821	September 7, 2024
Why Does OpenAI's API Struggle to Match ChatGPT's Commercial Response Quality API gpt-4 , chatgpt , api	9	1038	May 1, 2025
Unexpected Token Discrepancy in GPT-4o Mini Vision Billing vs. API Usage Bugs api	2	336	February 5, 2025

Why Do Vision Models Count Correctly in UI But Not Via API?

Related topics