Issue with License Plate Text Recognition and Bounding Box Coordinates using GPT API

luquepedro · May 28, 2024, 12:08am

Hello everyone,

I’m working on a project where I need to recognize the text from vehicle license plates and also determine the bounding box coordinates of the license plates in images. When I use the GPT API, it accurately recognizes the text from the license plates. However, it fails to provide the correct coordinates for the bounding box of the license plate.

The API correctly reads the text on the license plate, but the coordinates it returns for the bounding box are incorrect.

Could anyone explain why this discrepancy occurs and how I can improve the accuracy of the bounding box detection? Are there any specific models, pre-processing steps, or API configurations that you recommend for this type of task?

Thank you for your help!

N2U · May 28, 2024, 12:37am

Hey and welcome to the forum!

This is an expected limitation, you can read more about the limitations here:

https://platform.openai.com/docs/guides/vision/limitations

I’ll recommend using YOLO for automatic number plate recognition and segmentation instead.

luquepedro · May 28, 2024, 1:52am

Thank you for the recommendation, I will look into it.

dekret.roman · June 11, 2024, 8:45pm

Hi Luquepedro,

I’m not answering your question but just out of curiosity, how did you manage correctly recognising licence plates? My implementation recognising plates very differently all the time, accuracy is not that good.

Thanks for any hint

Topic		Replies	Views
Gpt 4 api help error in response API gpt-4 , api	0	226	March 31, 2024
GPT4 V Object detection bounding box value incorrect Prompting gpt-4 , gpt-4-vision	1	1554	June 29, 2024
How to solve the problem that GPT-API cannot read text using OCR? API	19	2627	July 10, 2024
GPT 4 Vision Model misrepresentation of text from an Invoice (OCR Task) API gpt-4	4	1147	July 31, 2024
Vision API flips numbers on extracting text from image Bugs	3	1031	December 13, 2023

Issue with License Plate Text Recognition and Bounding Box Coordinates using GPT API

Related topics