Is there any effective way to prevent DALL-E from adding fonts/text/characters into the images?
I know negative prompts aren’t working well, so what is a good way to deal with this?
Have you tried using the terminology “image only without typography”?
Yes, but sadly it has zero effect on the outcome. For example, here’s the image it generates for the following prompt:
Blockquote Create a vector artwork image in a 1.91:1 aspect ratio and in 1200 x 628 pixels, for a blog post paragraph for a blog post discussing the Introduction to auto loans and covering the topic of auto loans. The image should be image only without dypography, and it should contain at least one person or object related to the paragraph title. The image should contain only vector illustrations. Use primarily a blue color palette. Use a clean, solid white background.
A quick fix for DALL-E text issues is to refine your prompts.
For instance, ask ChatGPT to describe a specific image focusing on visuals.
Then, edit this description to exclude text references and use it as a DALL-E prompt.
While this workaround can be effective, it’s a temporary solution.
Think creatively about instructing ChatGPT to generate unique images for each topic.
Example Prompt:
- Describe: “Describe image [number] or use generation ID [ID].”
- Edit: Remove references to personalities, brands, and text-generating elements.
- New Prompt: Form a prompt for DALL-E based on this edited description.
- Generate: Submit the prompt to DALL-E.
Try adding these 3 rules to your prompt:
- Focus on specific, visually representable elements.
- Describe actions and scenarios rather than abstract concepts.
- Avoid ambiguous language that could be interpreted as including text.
It helps! like tendencies of text is probably 10%
Hi, wondering if you managed to find sustainable solution for this?
For me also “do not include any text or letters” didn’t effect, my 2 cents is focusing only on the object rather then usage, assuming that by specifying “for blog post” (or “for a travel journal” in my case) makes model provide a “complete” solution as (again, assumption) these types of media often use text as well for emphasizing the idea through the artwork
Yes, similarly to what you suggested, the best solution I found was to ask GPT4 to write a new visual prompt for DALL-E while avoiding any mention of books, signs, titles, etc, and then use this prompt for DALL-E.
As long as the prompt doesn’t mention anything “blog” related or anything that assumes the use of text in the image, then it usually works fine.
Yeah, when you try to use a negative prompt (do not do this) it usually doesn’t work very well. Better to focus on what word is causing it… sounds like blog or journal could be it for you.
It’s very disappointing that negative prompts have no use yet on DALL-E 3. Stable Difussion uses negative prompts very accurately… Hopefully, DALL-E 4 fixes this.
As said above, I can also confim that “character” word seems much more effective than others. I appended existing prompt with something like “Do not use any character on image” and the results are much better now. Though still running into some images with letter-ish lines, but at least not full text
Working on creating a reusable prompt that keeps DALL-E 3 focused:
You are a famous artist who has been asked to create an original image that appeals to a toddler based on the following criteria:
- Main Theme: a sweet and friendly animal
- It should be extremely minimalistic with only shapes, colors and lines so that it can easily be replicated within a few minutes.
- It should only contain the “Main Theme” and no other elements in the foreground, background or surrounding space.
- It should contain the “Main Theme” only once with no margins above, below or on either side.
- The “Main Theme” should consume the entire 1024x1024 space.
- It should not divide the “Main Theme” into separate parts of the image nor imply any variations of it.
- It should not contain any text, labels, borders, measurements nor design elements of any kind.
- The image should be suitable for digital printing without any instructional or guiding elements.
Another example:
You are a famous artist who has been asked to create an original image that appeals to an adult based on the following criteria:
- Main Theme: a sport or athlete
- It should be colorful, realistic, minimalistic, and somewhat of a challenge to replicate.
- It should only contain the “Main Theme” and no other elements in the foreground, background or surrounding space.
- It should contain the “Main Theme” only once with no margins above, below or on either side.
- The “Main Theme” should consume the entire 1024x1024 space.
- It should not divide the “Main Theme” into separate parts of the image nor imply any variations of it.
- It should not contain any text, labels, borders, measurements nor design elements of any kind.
- The image should be suitable for digital printing without any instructional or guiding elements.