Any option to use Natural style?

Berryrf4ew43 · January 1, 2024, 12:35am

When using dalle via an API, it’s possible to disable this overly fake airbrushed “vivid style” and use the “natural style” that works so much better for things like animals. However it costs money and it’s pretty expensive and cost inefficient.

Is there any way to use natural style, say, via Bing image creator?

_j · January 1, 2024, 1:50am

The vivid style is mainly language and triggering. Bing image creator:

Exact prompt: Style: natural realistic digital photo by telephoto lens on Nikon D50 DSLR camera. Subject: African grassland steppe with a lioness crouching while her lion cubs are playing and adventuring around her.

And then crank up the words to Bing:

Use exact prompt: "Style: natural realistic digital photograph using Nikon digital camera and 300mm telephoto lens, taken during golden hour. Subject: African grassland steppe with detailed blades of grass in the foreground hide the paws of a lioness (female lion) who is crouching and looking over the savanna, while her lion cubs are playing and adventuring around her. This pro photo reveals sharp details of fur and faces, with a bokeh background of bush trees and small hills

(and you thought it had problems with the same human faces?)

verify.

(token)assistant(token) to=dalle.text2im  (token){
    "prompt": "Photo Subject: Lioness and cubs. African grassland steppe with detailed blades of grass in the foreground hide the paws of a lioness (female lion) who is crouching and looking over the savanna, while her lion cubs are playing and adventuring around her. This pro photo reveals sharp details of fur and faces, with a bokeh background of bush trees and small hills. Style: natural realistic digital photograph using Nikon digital camera and 300mm telephoto lens, taken during golden hour.",
    "size": "1024x1024",
    "n": 1
}(token)

jr.2509 · January 1, 2024, 7:38am

Thank you @_j for sharing this example. I don’t use DALL-E a lot but I thought this prompt example was really insightful and helpful.

Berryrf4ew43 · January 2, 2024, 4:40am

Hi, I appreciate your reply. Unfortunately this does not achieve the natural style, at all. I also do not understand by your “verify” statement-is it supposed to be verifying that it is using the natural style? I do not believe that to be the case.

Here is an example of natural style with the API that does not seem to be replicable with bing: Imgur: The magic of the Internet

Prompt: wide shot full body photo of a flat bodied lizard on the rocks

It looks like a real believable photo and animal.

_j · January 2, 2024, 5:00am

LIZARD!

The API has another return you can pull out of the response: rewritten prompt. That text is how your short image prompt was rewritten by AI to be much longer. You can observe the degree that language of the prompt, rewritten to be longer and descriptive from the AI’s imagination, is then colored by your API choice of natural or vivid.

You can write similar language tweaks to Bing, but it is also a black box with an input length limitation and without any further knobs to twiddle.

My verification is of the actual language I provide being sent to ChatGPT’s internal DALL-E method - and getting the same imagery style from both platforms.

Danh · January 2, 2024, 10:37pm

I’ve come to accept that Dalle now has an artistic style and that’s just how it is. It’s disappointing if you’re expecting to be able to create photo realistic images, but there are other products around for that.

I’m wondering if OpenAI went with this style because it’s easier to produce a good result than trying to create a great photo realistic images.

Berryrf4ew43 · January 3, 2024, 1:59am

I don’t believe this to be the case-I know that via API dalle sometimes randomizes the prompt, but I have used things like putting “DO NOT MODIFY THIS PROMPT” at the beginning, and after generating the image, it shows me what the exact prompt was. I don’t believe there is further modification going behind-they just changed the model

_j · January 3, 2024, 2:40am

“A Hawaiian lizard basking on a rock.”
quality=“hd”, size=“1792x1024”, style=“natural” →

‘revised_prompt’: ‘A vivid image showcasing a Hawaiian lizard in its natural habitat. The lizard is endowed with vibrant detail; catching the sunlight with its prismatic, cold-blood body and basking in the warmth on a rough, worn, dark volcanic rock. The rock is comfortably nestled amidst a lush, tropical landscape where the sunlight peeking through the canopy dappling everything in its path. A slight ocean breeze rustles the verdant vegetation around, creating a serene, tranquil ambiance. The background subtly reveals the pristine blue waters of the nearby Pacific Ocean, the languid waves lapping against the shore can be faintly perceived.’

quality=“hd”, size=“1792x1024”, style=“vivid” →

‘Capture an image of a Hawaiian lizard spread out leisurely on a large sunlit rock. The reptile basks under the glowing sun with its rough, scaly skin prominent. It is comfortably relaxing on the course, uneven surface of an earthy-hued boulder. The rock is strategically positioned in a tranquil setting, surrounded by lush green flora typical of the Hawaiian islands.’

Natural:

Vivid:

(no, they are not reversed - the AI wrote “vivid” and “vibrant detail” and color descriptions into the “natural” request. Bug much?)

PaulBellow · January 3, 2024, 3:33am

You can’t switch on the natural style with the free Bing, I believe.

What do you find difficult about the API?

Berryrf4ew43 · January 3, 2024, 5:07am

Having to pay for it. Not only does it cost money-it’s extremely cost inefficient and quite expensive.

PaulBellow · January 3, 2024, 5:17am

Ah, gotcha. $0.04 isn’t a lot, but free is indeed cheaper.

Berryrf4ew43 · January 6, 2024, 5:20am

Four cents per image might not seem like much but it adds up very, very fast, especially when you consider how many iterations you need for a proper image sometimes. One can go through a lot of imagesI suppose I’ll have to wait for other models like stable diffusion to catch up. Unfortunate.

Topic		Replies	Views
Why the Quality of DALL-E3 API is Significantly Lower Compared to the Original API dalle3	28	10568	August 7, 2024
DALLE3 API poor quality even considering prompt revision API dalle3	8	1548	March 25, 2024
I'm So Sick Of Drawing Pencils In Every Image Prompting image-generation , prompt	15	1617	July 2, 2024
DALL-E 3 API images being much worse than ChatGPT API chatgpt , dalle3	6	3903	December 17, 2023
What happened to the old model? It was so much better Community large-language-model , dalle3 , dalle3-bing	3	1073	December 14, 2023

Any option to use Natural style?

Related topics