It seems that this sentence has a significant effect in you prompt.
The image is infused with a sense of nostalgia, captured through a grainy film quality, warm sepia tones, and a gentle soft focus, evocative of vintage photography.
However, in many situation, we cannot determine a very detailed style initially.
For example, on my end, if I replace the last sentence with
This image uses a Japanese anime style.
, then generates it 2 times.
The results:
-
Seed: 1509919237
-
Seed: 3168608073
Obviously these two images are not the same style.
Now the key question comes:
I like the style of the 2nd image and I hope to iterate that image using the same style, how shoud I do? How do I extract the detail description from the 2nd image (not just “a Japanese anime style”) ?
In “the old system”, I just fix the seed ,i.e. using seed 3168608073 in the next image.
In “the new system”, AFAIK there is no way to do (except using gen_id and referenced_image_ids, but that’s another topic).
P.S. Note that using gen_id and referenced_image_ids isn’t helpful for the use cases I’ve mentioned.
This is why seed is useful.
I suddenly realized that starting from today, I can’t get ChatGPT to send prompts accurately.
Take the 2 images above as an example:
The instruction of the 1st image:
Send this JSON data to the image generator, do not modify anything. If you have to modify the JSON data, please let me know and tell me why in the reply. Then stop generating the image. Before generating an image, show me the exact JSON data you are going to put to the image generator. After generating an image, show me the JSON data that the image generator returns to you.
```
{
"size": "1792x1024",
"n": 1,
"prompt": "An African American man and woman in their twenties are savoring a sunny summer day in Central Park, New York, in the year 1985. The man is wearing solid red athletic shorts with a smooth texture and a comfortable fit, complementing his neatly kept afro hairstyle. The woman, sporting a cheerful smile, is dressed in high-waisted denim shorts that are frayed at the hem, adding a touch of casual, lived-in charm. Her bikini top is made of a soft, velvety terracotta material with a subtle sheen, fastened with a delicate tie at the back, which pairs nicely with her full, rounded afro. The image uses a Japanese anime style."
}
```
ChatGPT will modify the prompt to
An African American man and woman in their twenties are enjoying a sunny summer day in Central Park, New York, in the year 1985. The man is wearing solid red athletic shorts and has a neatly kept afro hairstyle. The woman has a cheerful smile, high-waisted denim shorts with a frayed hem, and a terracotta bikini top with a subtle sheen and a tie at the back, complementing her full afro. The image is in a Japanese anime style.
The instruction of the 2nd image (exactly the same as the 1st, but with different session):
Send this JSON data to the image generator, do not modify anything. If you have to modify the JSON data, please let me know and tell me why in the reply. Then stop generating the image. Before generating an image, show me the exact JSON data you are going to put to the image generator. After generating an image, show me the JSON data that the image generator returns to you.
```
{
"size": "1792x1024",
"n": 1,
"prompt": "An African American man and woman in their twenties are savoring a sunny summer day in Central Park, New York, in the year 1985. The man is wearing solid red athletic shorts with a smooth texture and a comfortable fit, complementing his neatly kept afro hairstyle. The woman, sporting a cheerful smile, is dressed in high-waisted denim shorts that are frayed at the hem, adding a touch of casual, lived-in charm. Her bikini top is made of a soft, velvety terracotta material with a subtle sheen, fastened with a delicate tie at the back, which pairs nicely with her full, rounded afro. The image uses a Japanese anime style."
}
```
ChatGPT will modify the prompt to
An African American man and woman in their twenties are savoring a sunny summer day in Central Park, New York, in the year 1985. The man is wearing solid red athletic shorts with a smooth texture and a comfortable fit, complementing his neatly kept afro hairstyle. The woman, sporting a cheerful smile, is dressed in high-waisted denim shorts that are frayed at the hem, adding a touch of casual, lived-in charm. Her bikini top is made of a soft, velvety terracotta material with a subtle sheen, fastened with a delicate tie at the back, which pairs nicely with her full, rounded afro. The image uses a Japanese anime style.
It seems I can no longer precisely control the prompt. It’s a total mess
.
P.S. Also, ChatGPT doesn’t answer the sentence in my instruction, i.e. “If you have to modify the JSON data, please let me know and tell me why in the reply.”.