GPT4-vision model counting problem

I noticed that gpt4 vision hallucinates when it comes to couting objects in an image. How to solve this problem?

1 Like

To be honest, the best way to solve that problem is to use a traditional computer-vision approach such as an object detection model. GPT-4 is an incredible tool, but sometimes you need to go back to the basics.

One thing you could investigate is using a “classic” computer vision model in tandem with GPT-4-Vision as discussed in this research paper.