I am passing a JSON schema for output to the image API, asking for a list of objects with their description and bounding boxes. I get some bounding boxes that are somewhat related to the truth, but very loose. In Azure cognitive services I can get accurate bounding boxes of detected objects (but no …

Finding accurate location of objects from image API?

PianoGamer March 23, 2025, 1:01am 2

For comparison, this is the results from Azure cognitive services

Topic		Replies	Views
GPT4 V Object detection bounding box value incorrect Prompting gpt-4 , gpt-4-vision	1	2189	June 29, 2024
Getting GPT Vision To Return Coordinates Prompting gpt-4 , gpt-4-vision	8	6997	February 4, 2025
Identifying pixel positions of elements in an image API	3	99	March 17, 2025
Limitation from resizeing Prompting gpt-4-vision	5	126	September 12, 2024
Issue with License Plate Text Recognition and Bounding Box Coordinates using GPT API API	3	721	June 11, 2024