Hi,
We are developing an app that allows users to input a story, such as “Wonderland – Down the Rabbit Hole.” Our application will first split the whole story into chunks based on the scenes, potentially using some OpenAI API for this chunking.
Then, we want to use the DALL-E API to generate images for each scene prompt .
-
Currently, we are manually chunking the story and generating an image for each sentence, representing individual scenes. We want OpenAI to handle the chunking for us. I believe using prompt engineering could be a good choice for this. What would be a suitable prompt for this? Any other suggestions would be appreciated as well.
-
The images generated from individual independent prompts are not coherent and consistent according to the theme of the story. For instance, the character (who should remain the same throughout the story) appears completely different in each scene in terms of costume and physical appearance. Additionally, the background environment changes with every prompt. We need consistency throughout the story maintaining the context.
How can we achieve this consistency?
Thanks for your time and consideration.