Giving images to Dalle is not great

I have used Midjourney and SDXL and other models, but none did using images for new images as bad as this, I guess it describes it using CLIP or something similar instead of feeding it the image data, but normally you get a fairly accurate similar image in any other model, making it hard to mix and blend trying to get to something