Make dalle-3 to produce text I ask for

chcw · March 31, 2024, 10:37am

Hi,

I have tried many times by asking dalle-3 to put a text in the image. For example,

draw a large file icon and write "ZIP" on it

But often dalle-3 will put wrong texts, such as “ZP”, “Z”, “ZAP”, for 10 reties, only about 4 of them will produce the correct text. Why? How can make dalle-3 to produce the exact text I ask for in the prompt?

merefield · March 31, 2024, 12:10pm

Yeah, current generations of image generative AI aren’t very good with text. It can work sometimes, but it’s not reliable.

jr.2509 · March 31, 2024, 12:51pm

There’s specific reference to this limitation in the DALL-E-3 paper:

5.2 Text rendering
When building our captioner, we paid special attention to ensuring that it was able to include prominent words found in images in the captions it generated. As a result, DALL-E 3 can generate text when prompted. During testing, we have noticed that this capability is unreliable as words are have missing or extra characters. We suspect this may have to do with the T5 text encoder we used: when the model encounters text in a prompt, it actually sees tokens that represent whole words and must map those to letters in an image. In future work, we would like to explore conditioning on character-level language models to help improve this behavior.

chcw · March 31, 2024, 2:02pm

Thank you. Seems no solution currently.

It is really time & money consuming to regenerating images that contains the correct text.

Topic		Replies	Views
Spelling mistakes in Dalle-3 generated images API gpt-4 , dall-e-3 , dalle3	15	11413	July 31, 2024
Issues when using dall-e-3 model API gpt-4 , api , dalle3	1	1313	January 8, 2024
Does anyone experience issues with Dall-E3 generating typos in text within images? Prompting gpt-4 , dalle3	16	24540	February 19, 2024
DALL.E 3 Image Generation API dall-e-3 , dalle3 , dalle , dalle3-bugs	2	175	February 11, 2025
Can't make dall.e include a text in a design properly Prompting dalle3	2	2971	December 29, 2023

Make dalle-3 to produce text I ask for

Related topics