Strange Responses from Vision API

muhammad.nazim.k · May 21, 2024, 10:59am

Getting random responses on blank image with just lines and box with space to write something but it empty no hand written text on it. I m trying to use this API to extract hand written text form the images.

jr.2509 · May 21, 2024, 11:30am

Hi and welcome to the Community!

Can I just clarify: your input in this particular example was a blank image? If yes, then what you experience is a normal hallucination akin to what would happen if you provided the model with an empty text string as input.

Is my understanding correct that you have situations when there is handwritten text on the image AND situations where the image remains blank? If so, then you should clarify this in your prompt and additionally ask the model to simply return a default response such as “no handwritten text” detected in situations where the image is blank.

Feel free to provide your existing prompt for reference.

muhammad.nazim.k · May 21, 2024, 11:46am

I cant share the original image, but the image has a box with single space for writing English answer to some questions. So its not totally white / blank image, it is an image with space to write answer in hand writing, but nothing written on it.

muhammad.nazim.k · May 21, 2024, 11:49am

So its a unexpected random response from the API, on one occasion i received this.

_j · May 21, 2024, 11:59am

This may be a case where you don’t have enough system prompt to guide the AI in its task, where by not setting an identity and a job for the AI to do as an entity, it is more likely to produce the likely language response to the input than to actually examine the image - if the image was sent.

A grainy or shaded image also allows some embeddings to be activated to power a hallucination.

Without pasting into some scripts, adjusting temperature, or using the detail option, sending to ChatGPT and its system message, no hallucination on clear input:

Untitled

Topic		Replies	Views
GPT 4 Vision With Blank Page Creates Weird Results API	3	266	June 23, 2024
Unexpected/odd behavior while generating response using GPT-4V API api , community , gpt-4-vision	2	579	May 21, 2024
Vision is creating completely made-up answers Bugs gpt-4-vision	6	726	March 3, 2024
Getting data from other peoples images on vision API Bugs gpt-4	1	85	August 17, 2024
GPT-4 omni text recognition via API works worse than on chatgpt.com API gpt-4 , api	4	1239	August 13, 2024

Strange Responses from Vision API

Related topics