Issues when using dall-e-3 model

chcw · January 8, 2024, 3:24am

Hi,

I am using dall-e-3 via API, and find some issues:

Sometimes the image will filled by many objects, even I have add the text “Do NOT put too many objects to the image.”. Why?
There are always typos in the image. For example, “Access” is misspelled as “Acess”, and so on.
I am using 1792x1024 dimension. It seems that sometimes the image is cropped and some objects are cut in half.

How to solve these issues?

Thank you

PaulBellow · January 8, 2024, 7:11am

Negative prompts are hard for an LLM to achieve - ie telling it NOT to do something. When it adds a lot of items/people, it usually means your prompt is not detailed enough, and it’s filling in all the space it can. Be specific about number of items/people in the scene, so it knows what you want. It’s not a mind reader (yet!)
That it can do text at all is amazing, but yes, it still has a lot of flaws and likely shouldn’t be used in production settings.
This is sometimes the prompt (a single word which can be hard to pin down)… The DALLE team knows about this (and other) problems and are diligently working on them, I’ve heard!

Topic		Replies	Views
Does anyone experience issues with Dall-E3 generating typos in text within images? Prompting gpt-4 , dalle3	16	24531	February 19, 2024
Spelling mistakes in Dalle-3 generated images API gpt-4 , dall-e-3 , dalle3	15	11405	July 31, 2024
Make dalle-3 to produce text I ask for API dalle3	3	1343	March 31, 2024
DALLE3 API writes literal prompt as text in the image API dalle3	4	1308	December 2, 2023
Dalle producing hallucinated text and nonsensical words in the image API api , dalle	4	419	July 23, 2024