Even though the model is capable of distinguishing objects well, it doesn’t actually see them physically as humans do. Even humans, who physically see, might perceive something as four pieces if they haven’t read what you wrote or don’t have sharp vision. Generally, the model operates based on its fundamental capabilities unless given specific instructions.
The term ‘observation’ for humans involves a single method of seeing combined with cognitive processing of what is seen. However, the model uses different methods of ‘seeing,’ such as edge detection, and then processes what it perceives.
You might add instructions to guide responses in various situations, compensating for the lack of specific training data input.
Current models like ChatGPT are intelligent enough to interpret words as actions that compensate for necessary information, but they are not smart enough to independently determine what to do to correct or solve an issue.