Dalle3 prompt to generate pencil sketches keeps including pencils in image

ideplo · March 12, 2024, 1:01am

Hello, any tips on how to get Dalle3 not to include pencils in the output image? I have a simple prompt like so:

Create a black and white hand-drawn image with a No. 2 pencil effect, featuring a futuristic car.

Thanks in advanced for any help.

PaulBellow · March 12, 2024, 1:06am

This is part of the problem, I think. We know what it means, but sometimes the model takes it literally instead of the style of sketch…

Try using other words. It still pops up occasionally for me as my prompts are auto-generated, but you can work around the limitation.

anon22939549 · March 12, 2024, 2:05am

Sometimes “natural-language” can just get in the way and it’s better to respect the model’s limitations and speak to it in its own language as best you can.

Using the Dall-E tool, please generate an image in accordance with this JSON-formatted specification. 

```json
{
  "subject": "futuristic car",
  "style": [
    "hand-drawn",
    "no. 2 pencil",
    "greyscale"
  ]
}
```

Result,

ideplo · March 12, 2024, 2:39am

I was able to get around the problem by doing this: Hand-drawn graphite illustrations. Thanks everyone for the tips

ideplo · March 12, 2024, 2:40am

Is the style param in AzureOpenAI Dalle3 api?

anon22939549 · March 12, 2024, 2:49am

Lol, it’s not a thing at all. It’s just something I used to ensure ChatGPT understood the difference between subject and style.

This is the prompt ChatGPT wound up sending to the Dall-E 3 model,

A futuristic car depicted in a hand-drawn style using a no. 2 pencil, presented in greyscale. The image should capture the sleek and advanced design of the vehicle, highlighting its innovative features and the smooth, aerodynamic shapes that define its silhouette.

PaulBellow · March 12, 2024, 2:54am

Looks like hand-drawn style is better than hand-drawn image.

And yeah, you can force prompts too, but you’re still under the Terms of Service. I occasionally still get blasted by moderation for innocent things.

anon22939549 · March 12, 2024, 3:48am

You hit on something I have written about here before,

Using specific and accurate art terms tends to improve performance.

PaulBellow · March 12, 2024, 4:20am

DALLE2exp was a lot better at fine-controls… Like, for example, the difference between

“pencil drawing”
“pencil sketch”
“rough sketch”
“napkin doodle”

DALLE3 tends to lean toward the “best” unless you guide it.

One of the problems I’ve seen with a lot of people new to the tech is that they’ll just pop in “picture of a house” and expect the model to read their mind on the rest. DALLE3 is better at this as it rewrites the prompt and fleshes it out, BUT it can lead to unforeseen/unwanted results.

So, I try to recommend being as detailed in the prompt as you can be. It’s quite large in the API. I don’t remember the exact number of characters off-hand, but it was quite high.

wclayf · March 12, 2024, 4:47am

Sounds like you’ve solved this already, but I might add that in my experience any concept you put in the prompt like “pencil sketch” the AI will try to convey that in the image as if you want the image itself to convey that it’s a pencil drawing and so the best way to do that is show a pencil in the image.

In general, everything that’s mentioned will normally attempt to be included, so if you mention a pencil it will try to include a pencil. But if you say “pencil-styled sketch” it knows you mean stylistically and not a literal pencil.

ideplo · March 12, 2024, 9:13pm

Guys you’ve been a tremendous help, thanks again.

I have another question; how do you prevent Dalle3 from distorting faces? It seems like it’s fine 2 out of 5 times using the same prompt.

PaulBellow · March 12, 2024, 9:24pm

If you concentrate on one or two people it usually does better (or if you include details), but it’s still rough at times. A lot better than DALLE2. Some of those generations still give me nightmares!

Seriously, though, you can play with the prompting, but you’ll still get misses…

wclayf · March 12, 2024, 9:30pm

I’m just making a wild guess, but one term I’ve seen used in prompts is “photorealistic”, but it may not work with your particular issue. I’ve noticed faces in crowds get quite distorted a lot, and I agree with PaulBellow that the fewer people there are in an image the better their faces look.

PaulBellow · March 12, 2024, 9:39pm

And if you don’t supply numbers (two people, one person, one cake, etc) it will sometimes try to fill the entire image… Again, it comes down to being specific in your original prompt and hoping edits/inserts don’t mess it up too much.

I still occasionally get a South Asian Half-Orc or similar lol

wclayf · March 12, 2024, 9:53pm

Yeah one way to even reduce the size of a person’s face in an image, is to specify other things that you want in the image, like objects in their surroundings. Since the LLM is going to have to fit those other things in the surroundings it will necessarily have to shrink the face to smaller than it might have otherwise been. So yeah, providing lots of details is key to getting good images.

michaelruddock · March 14, 2024, 9:35am

So are you saying that I have to use the API and make a JSON request to get this to work?

ideplo · March 14, 2024, 2:49pm

I’m not saying that at all. I mentioned I’m using the api. Replace no. 2 pencil with the below to see if that helps.

trenton.dambrowitz · March 14, 2024, 3:05pm

@michaelruddock It works just fine in the ChatGPT interface, you might have to tweak it around a bit though

michaelruddock · March 14, 2024, 7:12pm

@ideplo OK thank you, understood.
@trenton.dambrowitz thanks very much.

_j · March 14, 2024, 8:05pm

It is also down to turns of phrasing:

Topic		Replies	Views
Differences between Image Generation using API and ChatGPT API gpt-4 , image-generation	13	8083	June 25, 2024
API Image Generation in Dall-E-3 changes my original prompt without my permission API dalle3 , tp-1	28	32456	February 6, 2024
GPT Image is so dull and uninspiring vs Dall-E3 Community image-generation	7	769	April 28, 2025
Gpt-image-1 problems with mask edits Bugs api , image-generation , gpt-image-1	44	1432	May 15, 2025
DALLE3 Gallery for 2023/2024: Share Your Creations Community chatgpt , dalle3 , gallery	1179	68311	January 10, 2025

Dalle3 prompt to generate pencil sketches keeps including pencils in image

Related topics