This might be off-topic, but here are some tips on how to bypass some weaknesses that DALL-E still has. It took me quite some time to realize relatively simple things. This might help save some time when experimenting.
As already seen, DALL-E cannot process negations, so always describe positive desired properties to prevent DALL-E from getting the idea of adding something unwanted.
DALL-E takes everything in the prompt and tries to implement it, even if it doesn’t make sense or is contradictory. The more complex the implementation, the more likely errors are. For example, “close to the camera” or “close to the viewer” resulted in DALL-E adding a camera or a hand to the image, instead of placing the desired element close to the viewpoint. So far, “close to us” has worked. Also, the instruction “create” or “visualize an image” sometimes leads to DALL-E adding brushes and drawing tools, even with a hand that literally creates the image. Just describe the image it self and avoid to instruct DALL-E to create/generate/visualize the image.
If a text is very short, GPT tries to expand it to make it more interesting. This is good if creativity is desired and you intentionally give up some control. You can prevent this by writing “use the prompt unchanged as entered.” And if you are not writing in English, “use the prompt unchanged as entered, and only translate it into English.”
When pushing DALL-E to creative limits, nonsensical texts suddenly appear, where DALL-E inserts the prompt into the image, probably to describe it. This has been the strangest behavior so far. You cannot get rid of it with “don’t add any text”, on the contrary, you get more text. You have to change the prompt itself.
DALL-E seems to use some templates for image generation to increase the likelihood of appealing images. Depending on where these are triggered, they are almost impossible or completely impossible to remove.
It is also good to avoid possibility forms like “should” or “could” and instead directly describe what you want to have in the image.
And, write the most important thing first, then the details, and finally technical instructions like image size, etc.
Also, geometries are not yet fully understood. For example, a snake sometimes simply consists of a closed ring. The system is still not perfect…
It is also interesting to know that even GPT does not recognize some of these weaknesses and generates prompts for DALL-E that need to be improved.
Hope this helps.