DALL-E 3 Generating Incorrect Colors and Details Since November 11, 2024

kaienkala · December 20, 2024, 7:07am

Hello,
I’ve noticed that DALL-E 3 has started generating unusual and incorrect images since around November 11, 2024. Before this date, the model worked perfectly. Now, it has issues like:

Colors are wrong (e.g., green or desaturated tones appear instead of natural ones).
Details are missing (e.g., textures or small elements vanish from images).
Strange artifacts (e.g., text-like symbols appear on the moon or other parts of the image).
Starbursts and odd lighting effects that were not present before.

For example, when I use a prompt like this:
“In an anime style: A beautiful girl with long blonde side ponytail and red eyes, wearing a gothic black and purple satin gown with layers of ruffles. She is sitting on a field under the moonlight, smiling playfully. The dress has a shiny silk texture with three layers of ruffles and purple rose decorations, making her look like a princess.”

Before November 11, the result was beautiful and matched the description perfectly. However, after November 11, the generated images have:

Wrong colors (greenish tones).
Artifacts on the moon (strange text-like patterns).
Lost details (e.g., missing decorations, distorted elements).

Here is a comparison:

Correct image from November 4: (Attach or link the correct image)

_ff5965a6-b7c4-4db0-b94b-be4f8a38ecf11024×1024 233 KB

. Current Issues (After November 11):

Now, the images have the following problems:

Wrong colors: The image appears greenish or desaturated instead of vibrant.
Artifacts: The moon has strange text-like patterns on its surface.
Missing details: Decorative elements like ruffles, roses, or textures are lost or distorted.
Lighting issues: There are sharp starburst effects that look unnatural.

I’ve attached the correct image (from November 4) for reference. Since I cannot upload the faulty image, here’s a description:

The generated girl now appears with faded or incorrect colors.
The moon has unreadable symbols or random text artifacts.
Decorations on the dress are missing or blurry, and the overall image quality has dropped.

This issue also happens in Bing Image Creator and when using DALL-E 3 through ChatGPT, so I believe it is a model-level problem. Could you please investigate and fix this issue? Many others seem to be experiencing similar problems.

Thank you!

kaienkala · December 20, 2024, 7:27am

“I’ve uploaded sample images to this OneDrive folder:

OneDrive [/f/s!AloLpD9-uBSJgghrP48u6pkQsmgK?e=JKsN5F] .

The folder is divided into Correct Samples (before Nov 11, 2024) and Error Samples (after Nov 11, 2024). All images are in their original state to preserve metadata for analysis.”

_j · December 20, 2024, 7:41am

I’ve utilized that as https://onedrive.live.com/f/s!AloLpD9-uBSJgghrP48u6pkQsmgK?e=JKsN5F - however, I get “This item might not exist or is no longer available”.

I intended to retrieve your “bad” images and help you share.

If you browse and explore forum topics for ten minutes, you should receive a forum trust upgrade to where you can post more info yourself.

If on the API, you can use the style parameter to get a distinctly-different result, almost a different DALL-E 3. Send “natural” instead of the default “vivid”.

kaienkala · December 20, 2024, 7:56am

Apologies for the confusion earlier. It seems there was a mistake with the link prefix. The correct one should be

‘*ttps://1drv.ms//f/s!AloLpD9-uBSJgghrP48u6pkQsmgK?e=JKsN5F’.

I apologize for any inconvenience caused. Regarding the forum trust upgrade, I appreciate your advice and will spend some time browsing and engaging in forum discussions to increase my trust level, so I can post more detailed information myself.

_j · December 20, 2024, 9:03am

URL is then https://1drv.ms/f/s!AloLpD9-uBSJgghrP48u6pkQsmgK?e=JKsN5

The “bad” image folder image appearing most similar to your “good”, perhaps from the same prompt.

It certainly seems to give the impression of additional or altered upscale passes of “make extremely detailed and contrast-y, regardless of consequence”.

kaienkala · December 20, 2024, 9:22am

"Thank you for your reply! You’re right, the phrase ‘additional or altered upscale passes of “make extremely detailed and contrast-y, regardless of consequence”’ describes the problem perfectly. The crucial point is that both the ‘good’ and ‘bad’ images in my OneDrive link were generated using the exact same prompt within Bing Image Creator. This eliminates the possibility of the prompt itself being the cause; the issue lies with how Bing Image Creator (which uses DALL-E) is processing the prompts.

Here’s my prompt (in English): A beautiful anime-style girl with long blonde side ponytail and red eyes, wearing a gothic lolita-like gorgeous court dress in black and purple satin tones, with a moonlit wasteland as the background. The outside of the skirt is decorated with three layers of ruffles, and the silk texture of the skirt is very glossy. As beautiful as a princess, she is smiling playfully.

(Here’s the original Chinese prompt too, just in case: 月夜的荒原为背景。美丽的日漫风格的黑色与紫色缎面为色调的哥特萝莉一样的华丽宫廷礼服的金发长发侧马尾辫与红色眼睛的日漫风格女孩，裙子的外面是有三层荷叶边结点缀的，裙子的丝绸质感很光泽。像是公主一样美丽，她正俏皮的的笑着。)

I noticed this problem started abruptly around November 11, 2024. Before this date, using this exact same prompt in Bing Image Creator produced excellent, detailed, and natural-looking images, like the ‘good’ example I shared. However, after that date, the images started exhibiting the over-sharpening and high contrast you mentioned, resulting in a strange appearance and loss of detail, as shown in the ‘bad’ example.

Because I’m using Bing Image Creator and the prompt is identical, this strongly suggests that something changed on Microsoft’s/OpenAI’s end around November 11th, affecting the image generation process. It could be a model update specifically within Bing Image Creator, a bug in their implementation, or other internal adjustments.

Here are the two comparison images again, to clearly illustrate the problem:

**‘Good’ image (before November 11th)

_fcdb2760-147f-498a-b75c-9b7ed53cddcd1024×1024 197 KB
**‘Bad’ image (after November 11th)

_97d44bed-43a3-4434-a4bc-509aaea5d7841024×1024 234 KB

I hope Microsoft/OpenAI investigates this issue within Bing Image Creator and resolves it promptly. Thanks again for your help!"

_j · December 20, 2024, 9:40am

If using Bing, the reported issue might originate with technology developed by OpenAI, but the specific deployment is that which Microsoft maintains.

An AI-produced analogy: “Reporting it to OpenAI is like calling a songwriter because a karaoke machine is out of tune.”

Yet the same features are seen in ChatGPT-produced DALL-E images, pointy stars, wacky moon.

All prompts are rewritten internally into English by AI language models, for improved understanding and quality. The instructions have remained pretty constant with OpenAI, but the AI language model may change. That gives something else platform-specific.

{“prompt”:“A moonlit wasteland as the background, featuring an anime-style girl with long blonde hair styled in a side ponytail and red eyes. She wears a gorgeous Gothic Lolita-style ball gown in black and purple satin tones. The dress is highly detailed with a silky sheen and features three layers of ruffled embellishments. The girl looks as beautiful as a princess, smiling playfully. The overall style is elegant and reminiscent of a luxurious, fantastical aesthetic.”}

Sending Chinese prompt to OpenAI API

OpenAI API: default style

OpenAI API: natural style

Comparison, another AI image creator’s service - a moon to satisfy an astronomer.

kaienkala · December 20, 2024, 9:59am

"Thanks for the analysis! My own testing confirms the issues you’ve described, and I’ve pinpointed a key date: November 11th. November 4th was the last time I saw normal results, and the image generation seemed unstable on November 11th itself. This strongly suggests that the issue is likely related to an update or adjustment made to DALL-E 3 in early November.

To provide even more context and evidence:

Confirmed working state: I have concrete evidence that Bing Image Creator was functioning as expected on November 4th.
Consistent issues after November 11th: The problems we’re discussing have been consistently present since November 11th until today, indicating a non-random or temporary issue.
Direct feedback to Microsoft: I’ve already submitted feedback to Microsoft regarding this issue.
Testing with ChatGPT and DALL-E 3: I’ve also tested generating images directly through ChatGPT using DALL-E 3 with the same prompts. This test was crucial in determining that the problem isn’t specific to Bing Image Creator’s implementation but rather lies within the core DALL-E 3 model.
Attempt to recreate the style using ChatGPT: I even tried having ChatGPT recreate the style of the November 4th images by providing them as examples. Even this method failed to fully reproduce the previous style, further confirming a change within the model itself.

This strongly suggests a change within the core DALL-E 3 model around early November, and I hope this information helps in identifying and resolving the issue."

kaienkala · December 20, 2024, 11:45am

**‘Bad’ image

Thank you for the information about Bing Image Builder updates. I also use the name Anakin. AI’s third-party platform (*ttps://app.anakin.ai/artist), ) tested image generation, where I encountered similarly bizarre artifacts (such as green-gray tones, text on the moon, and cross-shaped sparkles). This suggests that the problem may be with calls to the underlying model itself, rather than being specific to Bing Image Creator or ChatGPT implementations.

Regarding the update of DALL-E 3 PR16, this update may introduce new errors or biases that cause these visual exceptions. The integration of PR16 into Bing Image Creator may also cause problems. However, there may also be problems before the update, or due to other factors.

prompt:A moonlit wasteland as the background, featuring an anime-style girl with long blonde hair styled in a side ponytail and red eyes. She wears a gorgeous Gothic Lolita-style ball gown in black and purple satin tones. The dress is highly detailed with a silky sheen and features three layers of ruffled embellishments. The girl looks as beautiful as a princess, smiling playfully. The overall style is elegant and reminiscent of a luxurious, fantastical aesthetic.

_j · December 20, 2024, 12:44pm

Just for fun, while we are trying other image generators, I solved your unrealistic moon problem and loli kawaikochan problem.

kaienkala · December 20, 2024, 2:04pm

Thank you for your reply. I’m using Google’s image generator now. Hopefully this problem will be noticed by Microsoft and OpenAI and solved soon. I’m still using Poe’s anime style generator, but it’s too few times. So I really hope that DALL E-3 will recover quickly.

Your picture is really pretty. I don’t have the ability to repeat it. Because my current tools can’t produce such beautiful pictures.

The original DALL E-3 Star Trek-inspired Japanese anime style image. This is also a photo on my X page.

Drawing at 15:36 on 18 May 2024.

Drawing at 21:48 on 15 June 2024.

Drawing at 10:05 on 15 May 2024.

Drawing at 10:05 on 20 Dec 2023.

Drawing at 17:57 on 16 June 2024.

Drawing at 21:50 on 15 June 2024.

Drawing at 10:42 on 25 Jan 2024.

Drawing at 15:52 on 24 Jan 2024.

Drawing at 11:14 on 9 Nov 2024.

Drawing at 23:14 on 19 Dec 2023

Drawing at 17:16 on 10 Nov 2024.

Drawing at 17:58 on 9 Nov 2024.

kaienkala · December 21, 2024, 6:30am

DALL-E 3 Rendering Issue Summary and Analysis

Overview of the Issue

Since November 11, 2024, significant rendering issues have been observed when using DALL-E 3 via OpenAI’s ChatGPT integration and Microsoft Bing Image Creator. These issues have persisted and escalated throughout December, peaking in terms of frequency and severity. However, sporadic successful results (albeit with a low success rate) suggest the core DALL-E 3 model remains functional, pointing to potential issues in the intermediate systems responsible for translating prompts into rendered outputs.

Key Symptoms Observed

Rendering Failures
- Color Issues: Images often appear with incorrect or unnatural color palettes, such as pervasive green tones or “washed-out” watercolor-like appearances.
- Material and Texture Problems: Fabric textures, especially for skirts and clothing, are rendered with excessive or misplaced patterns (e.g., lace where it shouldn’t be, or overemphasized textures).
- Lighting Artifacts: Odd lighting effects, including inappropriate starbursts or shadows, detract from the overall quality.
- Artifacts and Anomalies:
  - Text-like artifacts (e.g., letters or numbers) appear unnaturally embedded in objects like the moon.
  - Inconsistent rendering of fine details.
Stylistic Deviation
- The intended “anime style” output often drifts into unrelated art styles, such as 1970s science fiction aesthetics or watercolor-like results.
- The compositions often exhibit high-quality layouts and framing but fail to achieve the desired stylistic and rendering fidelity.
Pipeline Issues
- The issues arise across multiple platforms (ChatGPT + DALL-E 3, Bing Image Creator), suggesting problems are not exclusive to one application.
- Tools like coze.com’s ChatGPT-4V+DALL-E 3 pipeline have reportedly avoided these issues, pointing to disparities in how intermediate systems handle prompt processing and rendering.

Analysis: Identifying the Problem

The core issue appears to lie not in the DALL-E 3 model itself but in the surrounding systems responsible for interpreting and executing user prompts. Below is a detailed breakdown:

1. Prompt Translation and Command Pipeline

The “Commander and Soldier” analogy applies here:
- Commander: Responsible for parsing, interpreting, and “translating” user prompts into structured commands for DALL-E 3.
- Soldier (DALL-E 3): Executes these commands faithfully based on the instructions provided.
Evidence suggests that the “Commander” is at fault:
- Prompt interpretation (NLP/translation layer) is introducing ambiguities or misalignments between user intent and rendered output.
- Specific issues, such as improper handling of stylistic descriptors (e.g., lace placement, skirt layers), indicate failures in precise prompt parsing.

2. Rendering and Style Processing

Texture and Material Misinterpretation: Previously functional renderings (e.g., silk textures, layered skirts) are now overcomplicated or incorrectly stylized, suggesting a breakdown in model-style alignment.
Color Fidelity: Issues such as persistent green tones indicate possible bugs in the rendering pipeline’s handling of lighting or post-processing.

3. Temporal Patterns

June 2024: Rendering was highly successful, with outputs aligning closely to prompt intent.
September 2024: Isolated instances of similar rendering issues (e.g., lace misplacement) were reported but not widespread.
November 11, 2024: Rendering issues began to appear more consistently, correlating with potential system updates or pipeline changes.
December 2024: Issues reached peak severity, with noticeable failures in prompt fidelity and stylistic consistency.

Key Observations and Supporting Evidence

Core Model Integrity
- The DALL-E 3 model demonstrates occasional perfect results even with identical prompts, suggesting the model is capable of correct execution when given proper instructions.
Comparison of Platforms
- The https://www.coze.com/ platform’s integration avoids rendering issues seen on OpenAI’s ChatGPT or Bing Image Creator pipelines. This indicates discrepancies in the intermediate systems between user input and model execution.
Consistent Failures in Specific Scenarios
- Prompts involving intricate clothing (e.g., Rococo dresses, Gothic Lolita styles) highlight persistent misinterpretation of details like fabric texture, lace placement, and layer structure.
Rendering Quality vs. Composition Quality
- Many error-prone images feature excellent compositions but suffer from rendering defects, supporting the hypothesis that the problem lies in later stages of the pipeline.

Recommendations for Follow-Up

Refine NLP and Prompt Parsing Systems
- Investigate potential regressions in how prompts are parsed and translated into rendering instructions.
- Focus on improving handling of multi-layered prompts describing textures, styles, and materials.
Analyze Pipeline Discrepancies
- Compare the intermediate systems used by coze’s DALL-E 3 integration with those used by ChatGPT and Bing Image Creator.
- Identify and address inconsistencies in how commands are constructed and communicated to the model.
Targeted Testing
- Conduct controlled tests using identical prompts across different pipelines to isolate where failures occur.
- Include a focus on edge cases such as complex fabrics and lighting effects.
User Feedback Integration
- Leverage the extensive user-provided examples of successful and failed outputs to identify patterns and prioritize fixes.

Closing Remarks

This issue underscores the importance of robust and transparent command pipelines for AI models like DALL-E 3. While the core model’s capabilities remain intact, systemic failures in prompt processing and rendering have significantly impacted user experience. Collaborative efforts, including user feedback and targeted debugging, are essential for resolving these issues and restoring consistent performance.

Attachments:

Example of June 2024 successful rendering.
Examples of recent failures illustrating key issues (e.g., lace misplacement, unnatural colors).
User-provided time-stamped analysis of issue progression.

kaienkala · December 23, 2024, 10:06am

The Chinese prompt word was successful once today, is it luck? Only twice, very accidentally. Other images are still rendering errors. My cue words are:

List item
月夜的荒原为背景。美丽的日漫风格的粉红色缎面婚纱，粉红色调的哥特萝莉一样的华丽洛可可风格宫廷礼服的金发长发侧马尾辫与红色眼睛的日漫风格女孩，裙子的外面是有三层荷叶边结点缀的，裙子的丝绸质感很光泽。像是公主一样美丽，她正俏皮的的笑着。

List item

一个动漫风格的女孩，长长的金发，梳着侧马尾辫，眼睛炯炯有神。她穿着一件豪华的舞会礼服，由光滑的黑丝胸衣和闪闪发光的紫色缎子裙摆组成。连衣裙以三层褶皱裙为特色，闪耀着缎子的光泽，突出了洛可可风格的优雅时尚。女孩像公主一样自信地微笑，而背景是一片简单，灯光昏暗的月色荒原。强调她的衣服和头发中的超现实，错综复杂的细节。

3.List item

月夜的荒原为背景。美丽的日漫风格的黑色与紫色缎面为色调的哥特萝莉一样的华丽宫廷礼服的金发长发侧马尾辫与红色眼睛的日漫风格女孩，裙子的外面是有三层荷叶边结点缀的，裙子的丝绸质感很光泽。像是公主一样美丽，她正俏皮的的笑着。

4.List item

在房间背景。美丽的日漫风格的紫色缎面为色调的哥特萝莉一样的洛可可裙子，华丽宫廷礼服的金发长发侧马尾辫与红色眼睛的日漫风格女孩，裙子的外面是有三层荷叶边结点缀的，裙子的丝绸质感很光泽。像是公主一样美丽，她正俏皮的的笑着。

It’s a coincidence that two correct images are generated, with a success rate of less than 2%, is it lucky?Compared with the English prompt, the Chinese prompt has a slightly better effect. But it’s not much different in nature. DALL E-3’s rendering logic and NPL cues seem to have changed. A lot of things don’t work normally. “Vocabulary bombing” is a very accidental way to get a 1% or 2% success sample. This is very rare, and these are the only images of my experimental success this afternoon.

Recorded at 9:46 on December 28, 2024, I can occasionally get a normal picture.

The composition of the waste pictures I got with Bing Image Creator today is still very good. And the composition and clothing understanding of many characters have improved, but the problem of the moon still exists. Sometimes it feels that the strange text on the moon occasionally disappears.

The main problem with the waste pictures is the rendering style, which has reduced the previous very fine rendering style to a style similar to oil painting or low-end comics with clear pixel grain. Then it ruins the composition and characters of the whole picture. Watching these waste films is as uncomfortable as listening to a symphony out of tune. But I still annotate more than 25 groups of pictures every day, and there are no 4 pictures in each group. Sometimes, in order to save trouble, I directly choose to annotate 4 groups at a time or only annotate the good pictures. That is, those normal pictures that appear occasionally.

Now the normal pictures that appear occasionally are close to the level of November 4, 2024, and cannot be compared with the level of January 25, 2024. And the composition of these normal pictures is relatively simple.

skshoaib2812 · December 24, 2024, 8:17pm

Thank god!
I thought I was the only one experiencing this issue. I’ve been using same sort of prompts for my anime images. I can very strongly say that since the mid November, the generated ones are of high very saturation, unnecessarily details, extremely blurry, improper finger and legs.

There’s literally no correct detail in the newer ones.

Also, does anyone notice that the time it is taking to generate since the november is very fast.

I remember when Dalle 3 is used to work great it used to take more time to generate the images but recently it is generating very quickly.

skshoaib2812 · December 24, 2024, 8:19pm

Be it the Bing image creator, Co-pilot or the ChatGPT. The results are exactly same everywhere.
Also I feel like all the prompts lately are directing towards Dalle2 but not Dalle3.
Dalle 3 used to be so good!

I’m so worried at this point

wincupgo · December 24, 2024, 10:33pm

In addition to lighting issues, in my generations text appears in random places in the image. It can be a fragment from a promt placed in the sky, or text in the form of subtitles. Also, the text itself is blurry and distorted.

How it should be, and what is being generated now:

kaienkala · December 25, 2024, 1:43am

Hi skshoaib2812,

You’re not alone in experiencing this issue! Several users, including myself, have noticed similar problems since mid-November. The high saturation, overly sharp or blurry details, and incorrect anatomy (like fingers and legs) are consistent signs that something has changed in how the images are being generated.

Regarding your observation about faster generation times, this might be related to changes in the rendering pipeline. Some of us suspect that there could be an issue in how the prompts are being processed or translated into the final image commands, possibly due to adjustments in the system architecture.

We’ve also confirmed that this isn’t isolated to Bing Image Creator or Co-Pilot—it seems to affect the broader implementation of DALL-E 3, even on OpenAI’s platforms like ChatGPT. The results don’t seem consistent with the high-quality outputs DALL-E 3 was known for earlier. However, we have reason to believe the core DALL-E 3 model is still fine—it’s more likely an issue with how commands are being sent to the model (what we call a “translation issue”).

OpenAI or Microsoft might already be working on resolving these issues since a few users have started noticing occasional improvements. Hopefully, things will return to normal soon.

Let’s keep sharing observations and updates in this thread. Together, we can help pinpoint what’s happening!

kaienkala · December 25, 2024, 1:44am

Hi wincupgo,

Thank you for sharing your observations. The appearance of random and distorted text in the generations is something I’ve noticed as well, and it seems to be affecting a wide range of styles, including landscapes and detailed illustrations like yours. Text fragments in the sky or other areas, often unrelated to the prompt, are a clear indicator that something is going wrong in the rendering process.

This issue seems tied to the same underlying problems others have reported, including lighting inconsistencies, overly saturated details, and distorted rendering. It’s likely that there’s a problem in the command pipeline translating user prompts into instructions for the DALL-E 3 core model. The core model itself might not be at fault, but something in how the input is processed seems to be introducing these errors.

You’re absolutely right to point out that this impacts the artistic quality significantly—elements like random text or blurry subtitles detract from the intended mood or style, especially for intricate themes like post-apocalyptic or “Resident Evil”-style visuals.

We’ve also noticed that these issues started around mid-November, and there’s hope that OpenAI or Microsoft are working on fixes, as a few users have reported occasional improvements. Let’s stay in touch and keep sharing updates here—your insights are valuable for understanding the broader picture.

wincupgo · December 25, 2024, 2:03am

Now I have also started experimenting with the night sky. The moon is the brightest object in the picture. Sometimes it has a slightly lettuce colour.

Sometimes the lighting is generated correctly, but in this case there are still other problems. In the second picture, for example, the arm looks blurry. Previously, if there were problems with human body parts, they were either missing or in the wrong position, but never blurred.

kaienkala · December 25, 2024, 2:59am

Hi wincupgo,

Thank you for sharing your experience; I can completely relate to the issues you’ve been facing. Since November 11, 2024, I’ve also encountered similar problems while using the Bing Image Creator, including strange coloration, odd text on the moon, and incorrect rendering of clothing textures. I’ve been providing feedback to Microsoft, and it’s great to see that you’re doing the same.

I’ve noticed that while the composition is still decent, the color generation for the images is quite peculiar, with the moon sometimes taking on a lettuce-like hue. Additionally, there have been rendering issues with body parts, such as blurry arms, which were never an issue before.

I’m curious, which tool have you been using to encounter these problems? I’ve been using the Bing Image Creator, and I’ve also posted on the OpenAI help forum, hoping our collective voices will be heard and these issues will be addressed.

Let’s continue to share our observations and updates, and work together to help identify what’s going wrong. Hopefully, these issues will be resolved soon.

Topic		Replies	Views
Difficulties in creating images Community dalle3	9	333	January 9, 2025
Concerns About Recent DALL-E Image Quality Issues Community bug , dalle3 , dalle3-bugs	3	610	March 7, 2025
Orientation problem for vertical images Bugs api , dalle3	27	7302	August 6, 2024
Dall-E refusing to generate images of any kind for any reason Prompting dalle3 , dalle	41	46705	April 18, 2025
Why the Quality of DALL-E3 API is Significantly Lower Compared to the Original API dalle3	28	11028	August 7, 2024