Dear OpenAI Team,
I appreciate all the advancements you’ve made with ChatGPT and AI in general, but I’d like to express my concerns regarding DALL·E and image interpretation capabilities.
- DALL·E Improvement – Compared to competitors like Midjourney and Stable Diffusion XL, DALL·E 3 feels outdated. The image quality, realism, and consistency fall behind industry standards. It would be great to see a major update that improves fidelity, creativity, and coherence in image generation, aligning with OpenAI’s high standards.
- Image Input for ChatGPT API – It’s quite frustrating that, despite ChatGPT being able to interpret images in the web app, the API still lacks this functionality. Many developers and businesses would greatly benefit from this feature being available. Are there any plans to enable image input in the API soon?
- No Image Support for More Powerful Models – It’s disappointing that the most advanced models, like GPT-4 Turbo (o1), still do not have the ability to process images. This severely limits their usability for many real-world applications that require multimodal understanding. OpenAI is known for pushing the boundaries of AI, so it’s surprising to see this limitation persist.
I hope OpenAI will address these issues soon, as they are crucial for keeping up with the rapid advancements in AI-generated content. Looking forward to seeing improvements in these areas!