Using gpt-4o, what size are large images resized to

george-p · November 5, 2024, 5:19am

I’m attaching scanned images of documents and would like to know more about how images are handled behind the scenes.

For us, a typical image is about 2550x3300, but my understanding is that the images are resized to something like 1200x750, which means our document page that was 3300 tall is now 750.

Some of these are contracts 100 years old and both the document and scanned image can be poor, therefore, good resolution is critical. when the height is reduced to 750, we start getting a lot of mistakes.

First I would like to know what actually happens to the image, then I’m considering rotating the document 90 degrees to match the dimensions better.

Thanks in advance for any information you can offer.

_j · November 5, 2024, 5:52am

I would not rotate an image, unless you are orienting it correctly.

The operations performed are the same regardless of portrait or aspect ratio being sent.

Here’s my retelling of the documentation:

george-p · November 5, 2024, 7:33pm

Thanks. How about if I create the tiles myself so each tile is 768x768 and instruct gpt how to configure them? this little bit of increased resolution makes a big difference on the accuracy. Since we are processing birth certificates for a county, getting the child’s name correct is essential.

george-p · November 5, 2024, 10:44pm

FYI, I was able to split an image to a document into top and bottom halves so I think the dimensions of these fit the horizontal dimensions that GPT resizes images to much better and optimized everything for better resolution GPT combined the two images together with no problem and we ended up getting much better results. this leads me to believe that if we needed to we could actually tile the images into n tiles ourselves so GPT would be working with a very high resolution image.

Topic		Replies	Views
How Does the GPT-4V API deal with large Images? API gpt-4 , gpt-4-vision	0	1033	January 22, 2024
Gpt-4-vision-preview 'low' detail resizing API gpt-4-vision	5	6146	November 13, 2023
OpenAI GPT-4o Image Processing: 500 Errors and Long Response Times with Larger Requests API gpt-4 , api-vision	2	221	September 30, 2024
Limitations of GPT-4V's high res tiling process? API api , gpt4-vision	1	417	April 9, 2024
Resize parameter for gpt-4-vision-preview API	5	13930	December 10, 2023

Using gpt-4o, what size are large images resized to

Related topics