Can GPT 4o mini model understand multiple images?

AaryaJake · September 18, 2024, 7:02am

I have extracted a set of 40 key images from a video. Can the 4o mini model accept 40 images, identify the top 5 images in accordance with the instructions, and provide me with the relevant images?

Is there another way to get the best images from the ones that have been provided?

AaryaJake · September 18, 2024, 7:05am

Can it understand the provided images in sequence?

jr.2509 · September 18, 2024, 7:07am

Hi!

For a number of reasons this would not be possible. You would fast exceed the token limit for gpt-4o-mini with that amount of pictures. Additionally, the model would struggle to analyze 40 pictures in a single API call.

If I was in your place, I would not provide more than 2 pictures for a given request; ask the model to return a description of the picture (or whatever it is you need as a basis to make a selection). Then combine all these outputs and run a final API call to make the selection of the top 5 pictures based on the relevant criteria.

Topic		Replies	Views
How can i ask multiple questions for a set of images uploaded to gpt4 vision API gpt4-vision	0	1023	December 12, 2023
How to best work with 100s of images API gpt-4	0	1448	January 17, 2024
Maximum number of images in a GPT-4V request? API gpt-4 , gpt-4-vision	5	10859	November 17, 2023
Gpt 4o can only take 39 images? Bugs gpt-4o	2	5239	January 4, 2025
Is it possible to classify the image with the multiple image input in a single API call in gpt 4 vision model? Please help this usecase API gpt-4 , chat-with-images , gpt-4-vision	4	2682	March 8, 2024

Can GPT 4o mini model understand multiple images?

Related topics