Thanks man. Forums like this one provided by OpenAI, as well as platforms like Reddit, frequently serve as the breeding grounds for new discoveries. On social platforms, influencers often disseminate these ideas, presenting them as if they were their own. Consequently, their followers end up relying solely on the influencer. That’s my primary concern.
I think the API format is beneficial because it’s what ChatGPT would naturally generate on its own. However, what seems even more crucial is to ask ChatGPT to review the guidelines and to respect the directive of not changing the prompt, provided the guidelines are met, before passing it on to DALL-E.
When ChatGPT is confident that it’s not breaching any rules, its secondary objective becomes user satisfaction. Perhaps this is why the approach is effective.
Using seeds brings consistency to similar images. To achieve this, one can craft a near-identical prompt and use the same seed, making DALL-E’s image creation predictable. Since DALL-E produces the image, this approach works. On the other hand, when an image is fed into GPT-V, it’s transformed into a numerical format, which the model later deciphers. This isn’t the same as having all the components for an exact reproduction.
Nevertheless, if the aim is merely to use the image as a reference, OpenAI’s new update allows users to seamlessly switch between GPT-V and DALL-E within the same interface. But I haven’t had the opportunity to test it out yet.