GPT-4 Repeatedly Failing at Basic Image Analysis I Used to Rely On

I’m writing to express my extreme frustration and disappointment with the current behavior of ChatGPT, specifically regarding image analysis. I’ve used this tool extensively in the past, and one of the key features I relied on was its ability to accurately interpret GIMP screenshots and extract pixel-perfect coordinates from clearly defined rectangle selections. That used to work flawlessly.

Now? It doesn’t.

Over the past several attempts, I have sent multiple GIMP screenshots — each showing a crystal-clear rectangular selection — to extract coordinates for FFmpeg’s delogo filter. Despite the images clearly showing the selection and being aligned perfectly with the logo I’m trying to remove, GPT repeatedly returned:

  • Incorrect coordinates
  • Coordinates outside the video frame bounds
  • Or even delogo boxes that miss the mark entirely.

I had to repeat myself four times, each time providing more visual and verbal detail, including failed output screenshots — and even then it still failed. This is not a case of vague prompts. I did the work. I gave the tool everything it needed, just like I always have. But something has changed for the worse.

I don’t know what you’ve altered in the back end — but this regression is unacceptable. I relied on this level of precision for my professional video archival workflow. What used to be done in one step now takes four corrections, and even then I’m still having to finish the job myself.

This tool used to behave like a professional assistant. Now it’s like I’m dealing with someone who can’t even read a ruler on a screen. I am furious and deeply disappointed. If this isn’t resolved, I will seriously reconsider my use of ChatGPT for technical and archival work moving forward. See screenshot and command vs actual result It gave me delogo=x=585:y=400:w=120:h=40:show=0

running it

This used to work something has changed and it’s broken this..


all the delogo command generated did was smudged one side…

I provided a GIMP screenshot clearly showing the selected rectangle around the on-screen logo. The four corners were:

  • Top-left: (479, 387)
  • Bottom-left: (479, 418)
  • Top-right: (699, 387)
  • Bottom-right: (699, 418)

The frame was 720×480 — well within bounds — and everything was visible: rulers, selection box, zoom level, and canvas size.

Despite this, ChatGPT repeatedly:

  • Ignored or misread the selection box entirely
  • Returned incorrect x, y, width, height values
  • Suggested coordinates outside the 720px limit, which caused FFmpeg to fail
  • Failed to lock onto the visual region I clearly highlighted — even after multiple images

This is a major regression from how it used to perform: before, it would consistently extract the exact box from the GIMP screenshot without issue. Now, even with all the data provided, it failed 4+ times in a row.