Imagen is superior with mask edits, but there’s a lot of vagary in the various image APIs out there. It’d be helpful if OpenAI could better explain the limitations/expectations of the various common use cases (VTON, style transfer, color-up, character swap, background modification, feature editing, etc). Their mask example works fine but the setup of the shot lends itself to easy to fill in details for consistency.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Biggest difficulty in developing LLM apps | 75 | 6654 | January 12, 2024 | |
Phas -> Forest Of Thought | 19 | 583 | March 17, 2025 | |
Human behavior of misguide or cheating in ChatGPT? | 17 | 1607 | February 14, 2025 | |
Gpt-image-1 problems with mask edits | 32 | 816 | May 5, 2025 | |
I wonder how much OpenAi would pay to cure GPT lazyness | 18 | 1072 | February 9, 2024 |