Ok, so we have a product that uses DALLE3 for image generation, but it fails about 10% of the time due to content moderation filters. Users often can’t understand why their prompts are rejected, which has been frustrating for them.
We’ve tried using GPT-4 to rewrite the prompts when they fail the filter, but it doesn’t always work. We need a reliable set of instructions to help rewrite prompts so they comply with DALLE3 content filters.
Here’s what we have so far:
Rewrite the prompt to be simpler and avoid any references to NSFW content, copyrighted characters, or controversial topics. When mentioning art styles, only include artists whose work predates 1912, or describe the style in general terms. Ensure the prompt focuses on safe, universally acceptable themes without explicit, violent, or inappropriate content. Avoid political, controversial, or sensitive issues that might provoke or offend.
Any suggestion how to improve this to catch more cases would be great.