If I’ve got nothing better to do with ChatGPT Plus than make pictures, is that a problem?
Made these cute ones to see how DALL-E 3 performs on one of the prompts from the announcement of it, that prompt sent without AI rewriting to DALL-E direct.
One that was more visible in the blog no longer gives broccoli-faced tigers:
And I think the technology for making a wide or tall image is clearly on display in all I just posted and in contrast to yours. Your 1024x1024 image has contents extending off the screen, being cut off? Isn’t that a rather odd behavior?
Not if you imagine second passes of “edits” endpoint techniques that outfill the edges of a square image to make a wide image.
If the edges have no detail to be entailed, then there is nothing to continue upon:
In fact, that’s how you could even instruct the prompt, “The subject and background details extend up to and beyond the image edges.” - prompt which takes careful use not to include words like “framed” or “cropped” that are not understood but drawn:
I found that there are some cute works as well. So, I hope everyone feels encouraged to share their creations using both the API and ChatGPT.
At that time, I didn’t use the EDIT function and just sent the prompt directly to DALL-E. I found that some details are useful.
But I will put the details aside for now.
Hmm… I tried to create a playful image with a broccoli head based on a tree, but it seems to have gone wrong.
centursion
I’ll raise you an actual aquilifer
I couldn’t get an iconic looking vulpes frumentarius, but I guess they didn’t really look like anything in particular, being the roman CIA.
The OpenAI blog including DALL-E 3’s announcement and example has been decimated. Decimated might not be the right word, because “deci” as a root might imply there was even 1/10th the content remaining. Now the site is just corporate partner use cases back-dated into the blog and the DALL-E page just tells about safety considerations being taken.
So no more seeing if DALL-E will still do what it did from OpenAI’s own examples. Wayback machine also down probably due to another AI scraping campaign against it.
So I give you the best of 5 in the medium of polymer clay, with a high tendency to produce images with no content beyond the central box in others.
That has the signature appearance of having used the API parameter “natural”, which is actually symptomatic instead of natural: random objects dropped into the scene with no overall harmony of lighting or shadow, nor depiction of photographic depth or consistent sharpness. ChatGPT Plus users were getting an even worse version randomly for some of their generations.
The prompt:
"“Imagine a group of 8 explorers, women, men, and children, of varying ages and ethnicities, on a frontier, lush oil painting style, in the manner of a mid-19th century American landscape painter of the West. They have just reached a point in a mountain passage where they see a vast and beautiful land spread out before them, during the month of May.”
It had problems with number of explorers. And I had to go through some iterations until it got the multi-ethnic thing down. But the results were lovely. I ran this one through Photoshop to add an oil painting look.
I asked it about the inability to get the number of people correct. Fascinating responses.
"Generating images with an exact number of people can sometimes be challenging due to the complexity and subtlety involved in translating textual prompts into visual content. The AI uses patterns it learned during training to create images based on descriptions, but it doesn’t “count” in the human sense. Instead, it generates components of an image based on probabilities and associations from its training data.
When it comes to creating images with specific numbers of people, the AI might misinterpret the spatial and relational aspects of the prompt, leading to more or fewer figures than requested. Also, in complex scenes like group landscapes, the AI may struggle to maintain the exact number of figures while also trying to balance other elements of the composition, such as diversity and scenery. This can result in images that don’t perfectly adhere to the numerical aspect of the prompt."
"There are ongoing efforts to improve the accuracy and reliability of AI image generation, including better adherence to specific requests like the number of subjects in an image. Developers and researchers continually work on refining the models through updates and training with more diverse and detailed datasets. These updates aim to enhance the AI’s understanding of complex instructions and its ability to translate textual prompts into visually accurate representations more faithfully.
Further developments might include more sophisticated parsing of the text, better spatial understanding, and improved algorithms for image composition. These improvements could help the AI better manage specific requests like the exact number of people in a scene, making it more useful for precise tasks."
I asked it about its rendering problems when it generates a video for me.
Good stuff.
The AI model now called GPT-4 is simply a suggestible repeater of input, generating plausible completions.
(memory off; all tools disabled and empty custom instructions)
Welcome back. Hope you’ve been well.