Diversity in random images

Just noticed that 95% of the time I ask for a random image of something like a character or people.
Unless specified all of the images are of Caucasian people. Never does it create a image of a ethnic person by default. Only time it happens is if it requested.
Bad look

Hi @jolewilliampeterson and welcome to the community!

This is a very general issue across all GenAI models, not just Dall-E 3. The underlying misrepresentation in the internet data is basically working against us. It’s not just ethnicity/colour, but gender and lots of other cultural factors that are extremely asymmetric.

Turns out it’s a very difficult problem to solve, as Google found out earlier in the year.

1 Like

Yea, I get it. It’s just a shame that still in this world of tech and advancement we still stuffer from these things.
It’s just so off putting. There are more ethnic people in the world and yet if i ask for an image of someone eating a pizza, its 100% gonna be a White male.

thanks for the welcome to the community

1 Like

Gender bias is common also, a nurse is a woman, a firefighter a man. But it has gotten better. Bias is one of the things this tech will eventually cure imo. On the simple fact you can’t have a scientific tool that has built in bias. :rabbit:it is in the data and there is a lot of data to bootstrap

IE, Data Firefighters if mainly male: If your dataset is heavily skewed toward males (e.g., in a fire department where most workers are male), and you bootstrap from this, the resampled data will still predominantly include males. This is because bootstrapping draws with replacement from the original sample, which preserves its characteristics.

Bias through balanced data: If, instead, your dataset has an equal mix of males and females (or other groups of interest), then bootstrapping from this more representative dataset would better reflect diversity and reduce the bias toward over-represented groups.

Bias is one of the things I work with in my models.

1 Like