Describing images with GPT3

When I search all results that come back are on turning a description into an image but I want to do the opposite. I want to start with an image and have GPT3 describe to me what the image is of or even better have it build a description with added content of the surrounding text (I am processing webpages).

Is this possible?


For that you need to use OpenAI’s open source CLIP model - you can test it on replicate rmokady/clip_prefix_caption – Run with an API on Replicate

1 Like