Describing images with GPT3

When I search all results that come back are on turning a description into an image but I want to do the opposite. I want to start with an image and have GPT3 describe to me what the image is of or even better have it build a description with added content of the surrounding text (I am processing webpages).

Is this possible?

2 Likes

For that you need to use OpenAI’s open source CLIP model - you can test it on replicate rmokady/clip_prefix_caption – Run with an API on Replicate

2 Likes

Is it possible to use GPT-4 to describe images?

Welcome to the forum!

No, currently GPT04 can only deal with text. Imagine ingestion is a future release, no timeline for that yet.

1 Like

Ok thanks.
So what we can do in order to describe an image using Open AI API? Do you suggest any third party tool ?

You might take a look at technology like BASIC-L and CoCa, there are lots of image classification models out there. A ready build one would be the Microsoft Image Processing API, there are others.

1 Like