Can we use images with GPT-3

chinmay.duke · June 17, 2022, 3:47pm

If I have a document that describes something with pics, how should I go about using the image? For example, if I want to explain how to change a headlight, I want to describe:

Open the hood
Look for the bulb (It should look like this image)
Turn the bulb counterclockwise 90 degrees. At this point it should look like this image.

jhsmith12345 · June 17, 2022, 8:12pm

GPT3 does not have that capability. You will need to use a different model. I’m not sure what’s publicly available, probably something. If Google releases the stuff they’ve been working on to the public, you’ll really be in business.

daveshapautomator · June 17, 2022, 8:56pm

I’m planning on doing experiments with mixing GPT-3 and DALLE. Basically I want GPT-3 to generate a story and then describe each panel of a storyboard and then feed it into DALLE. I’ll be doing that once I get back from my vacation.

louis030195 · June 19, 2022, 8:59am

jhsmith12345 · June 19, 2022, 3:02pm

Very interesting! I need to go through your videos, because I don’t really “get” it yet…

william.kimandu · June 19, 2022, 5:20pm

the javascript playground is perfect for this
sorry forgot the URL

chinmay.duke · June 22, 2022, 1:09am

I dont have access. Just joined the waitlist.

Thx.

e4exp · June 25, 2022, 5:59am

hello, I think this approach can work

they use GPT-2 to generate captions for images. they used CLIP to control the generated captions to be matched with the images (no image is fed into GPT. images are input into CLIP).

NVIITester · November 22, 2022, 2:31pm

I wish. I am training DaVinci to resemble a character and if I could show it images and videos, it’s personality would become even more accurate.

torronen · November 22, 2022, 3:50pm

I think I have seen somewhere SVG’s being generated with the text-version of GPT-3.
Some impressive demos, but I never got it to work myself. Maybe it could work?

Topic		Replies	Views
ChatGPT goes Multimodal! Sound and vision is rolling out on ChatGPT Community chatgpt , multimodal	34	13128	December 10, 2023
Can gpt create images and videos? API chatgpt , api	1	1901	February 23, 2024
DALL-E API to generate json data from image API api	12	4513	December 19, 2023
How to Make GPT-3 Draw API	6	1799	August 2, 2021
What to expect after chatGPT and DALL-E API	1	596	December 29, 2022

Can we use images with GPT-3

Related topics