I am truly grateful for the release of the DALL-E 3 API. Since its launch, I have been conducting numerous experiments daily.
When generating images of size 1024x1792, I frequently encounter an issue where the output is a horizontal image rather than a vertical one. Is there a deliberate way to avoid this? I am curious if there is a method of prompt engineering or any specific instructions that could help. Alternatively, could this be an inherent issue stemming from the training phase of the model?
Attempts to direct the model with phrases like “vertical image” or “full-length vertical image” have not been very effective. Since this occurs quite often, I am eager to discover a solution.
We talked about this on Discord during the Alpha. They’re aware and working. I’m not sure what causes it exactly, but if you change the prompt slightly, you can sometimes find the trigger word. Another idea is to start a new thread once it happens once in a thread…
ETA: I’ve found “portrait” and “landscape” orientation helpful!
Another prompt attempt you might throw at the problem is “image rotated 90 degrees” with or without the API image dimensions shifted. If the AI “thinks” in landscape, let it make your picture sideways.
Vertical images just don’t work with CoPilot Pro. This should be easy to solve. In Chat GPT you can create these now with no problem but not in CoPilot
Cat image gmc ugigiThis image shows a whimsical scene where a cute, fluffy kitten with big eyes is holding a large crayon and standing in front of a colorful array of oversized crayons arranged in a semi-circle. The background depicts a lively urban street with bright lights and buildings, giving the impression of a bustling city. A person, possibly a photographer or filmmaker, is kneeling on the left side, aiming a camera at the kitten, suggesting that this is a photoshoot or filming session. The scene blends elements of animation and live-action, creating a charming and vibrant atmosphere.
My solution for this is that I use a model that takes images and ask this “If I see this image in a poster, do i need to rotate my head to see it properly? reply with yes or no” and based on that I retry if it is.