Hello everyone,
I’m working on a project where I need to recognize the text from vehicle license plates and also determine the bounding box coordinates of the license plates in images. When I use the GPT API, it accurately recognizes the text from the license plates. However, it fails to provide the correct coordinates for the bounding box of the license plate.
The API correctly reads the text on the license plate, but the coordinates it returns for the bounding box are incorrect.
Could anyone explain why this discrepancy occurs and how I can improve the accuracy of the bounding box detection? Are there any specific models, pre-processing steps, or API configurations that you recommend for this type of task?
Thank you for your help!