Note: Click the links and then download any image below if you want to see it’s full resolution, most are close however any image that has ex. 16 image stuck together left-to-right, those are just badly shrunken, do download those ones if interested in them.
Note: small GLIDE, made by openAI, below, is impressive even when being the smaller version which was only trained on 67-147 million text-image pairs or so, not 250M like the real GLIDE, and is 10x less parameters (300 million). Keep in mind I let it often take half an image as input and extend it by the other half, 128 pixels, but if it used all but the last row of pixels to predict the next row, it’d have been even more accurate !
Using text prompts and choosing which completion I liked, I made this by stitching them together (it only could be fed a square image, but still came good!):
No Text Prompt — extended all around:
Text & Image Prompts — elongated (scroll down page)
“tigers in river and forest” + above
= download-16 — ImgBB
“pikachu” + image
= download-6 — ImgBB
“fancy complex detailed royal wall gold gold gold gold”
“gold gates on clouds shining laboratory”
“gold bricks lined up in a room”
“gold dragon statue with wings and breathing fire”
GLIDE also works with no text prompt, it does fine, just ~2x worse maybe:
–no text prompts–
You can compare one of these to NUWA’s:
To use GLIDE, search Google for github glide openai. I use it in kaggle, as its faster than colab for sure. You must make an account then verify number then open this in colab and only then can you see on right side the settings panel and in there u need to turn on GPU and internet. Upload images to right side top Upload, and then in the image calling part of the code that says ex. grass.png you put there simply ex. see i have:
source_image_256 = read_image(’…/input/123456/tiger2.png’, size=256)
source_image_64 = read_image(’…/input/123456/tiger2.png’, size=64)
To control the mask change the 40: thingy to ex. 30 or 44. To control the mask sideways, add another one ex. [:0, :0, :30, :30] or something like that if I got it wrong, you just add one to the end i mean haha. Apparently you can add more than 1 mask (grey box) by doing ex:
Batch size sets the number of images to generate.
Once it is done, click console to get the image and right click it to save it.
Here’s mine for minDALL_E (this one had no image prompt allowed. So, just text.)
minDALL-E was only trained on 14 million text-image pairs. OpenAI’s was trained on 250M. And the model is only 1.5 billion parameters, ~10x smaller.
“a white robot standing on a red carpet, in a white room. the robot is glowing. an orange robotic arm near the robot is injecting the robot’s brain with red fuel rods. a robot arm is placing red rods into the robot brain.”
3 dancing robot pikachu lined up on skate boards on the road in front of the mall and a firetruck under the sun wearing blue helmets and red boots, while holding umbrellas and surrounded by electric towers. realistic photo.
“3 pikachu standng on red blocks lined up on the road under the sun, holding umbrellas, surrounded by electric towers”
download-1-min — ImgBB
“box cover art for the video game mario adventures 15. mario is jumping into a tall black pipe next to a system of pipes. the game case is red.”
an illustration of a baby capybara in a christmas sweater staring at its reflection in a mirror
an armchair in the shape of an avocado. an armchair imitating an avocado.
an illustration of an avocado in a suit walking a dog
pikachu riding a wave under clouds inside of a large jar on a table
a living room with 2 white armchairs and a painting of a mushroom. the painting of a mushroom is mounted above a modern fireplace.
a living room with 2 white armchairs and a painting of the collosseum. the painting is mounted above a modern fireplace.
pikachu sitting on an armchair in the shape of an avocado. pikachu sitting on an armchair imitating an avocado.
an illustration of pikachu in a suit staring at its reflection in a mirror
“a cute pikachu shaped armchair in a living room. a cute armchair imitating pikachu. a cute armchair in the shape of pikachu”
To use it, go to this link below, make a kaggle account, verify phone number, then in this link below, click edit it, then go to setting panel at right and turn on GPU and internet. Then replace the code below, it’s nearly same but makes it print more images. If you don’t, it doesn’t seem to work good.
images = images[rank]
n = num_candidates
fig = plt.figure(figsize=(6int(math.sqrt(n)), 6int(math.sqrt(n))))
for i in range(n):
ax = fig.add_subplot(int(math.sqrt(n)), int(math.sqrt(n)), i+1)