Feedback on Sora image generation for educational/toddler diagrams

I’ve been testing Sora and wanted to share some feedback after trying to create a bedtime routine diagram in Spanish for my toddler.

What works well:

  • I really appreciate that Sora offers multiple image options (usually four) with each prompt. That gave me a better chance of finding something usable, even when the results weren’t perfect. This feature is helpful and sets it apart from other tools like ChatGPT.

What needs improvement:

  1. Context memory: Sora doesn’t seem to retain context from previous prompts, making it hard to refine or revise an image based on earlier results.
  2. Structure consistency: Steps were often cut off, skipped, combined, or repeated, which made it unreliable for visual sequences.
  3. Mismatched visuals: The illustrations didn’t always align with the text. For example, brushing teeth might be accompanied by an image of a child clasping their hands together with no toothbrush in sight, or brushing teeth while also reading a book.
  4. Grammar issues: Spanish phrases were sometimes nonsensical — for example, “elegir los dientes” (choose the teeth) instead of “cepillar los dientes” (brush your teeth), or “decir dos libros” (say two books) instead of “leer dos libros” (read two books).

It would be great if future updates included:

  • Better context retention for iterative image editing.
  • More control to fix or regenerate individual parts of an image.
  • Improved language accuracy for common educational phrases.
  • Tighter alignment between text instructions and visuals.

Here is an example of the results for what should have been a ten-step bedtime routine; notice only eight steps are included.

Thanks for your work — there’s a lot of potential here, and I hope future versions will make it more reliable for creating helpful, multilingual visual aids for young kids.

Hi, welcome to the community!

As you know,

In Sora, Image generation can only make videos or pictures right now. It doesn’t have a chat feature like ChatGPT, so we can’t continue an image story by talking to it.

We can upload an image to change it, but it usually changes a lot and the original characters are lost.

Using 10 panels is too much because Image generation can’t really control more panels, objects, or the text in more grids properly, whether it’s in Spanish or English. It works better with 4, 5, or 6 panels. Sometimes it works, but prompts are not reusable.

Prompt

A horizontal 1536x1024 aspect ratio infographic for toddlers showing a 5-step vertical panels bedtime routine in Spanish. Each step includes a cartoon-style image of a young child, a symbolic icon, and a Spanish label in bold, rounded font. Steps: 1. Bañarse, 2. Ponerse el pijama, 3. Cepillar los dientes, 4. Leer dos libros, 5. Apagar la luz. Style: soft colors, white background, and clean layout for young children.

Summary

A horizontal 1536x1024 aspect ratio infographic for toddlers showing a 4-step bedtime routine in Spanish. First line of grids should have 2 and second line grids should have 2 panels. Each step includes a cartoon-style image of a young child, a symbolic icon (e.g., book, toothbrush, moon), and a Spanish label in bold, rounded font. Steps: 1. Bañarse, 2. Ponerse el pijama, 3. Cepillar los dientes, 4. Leer dos libros. Style: soft colors, white background, and clean layout for young children.

Prompt

A horizontal 1536x1024 aspect ratio infographic for toddlers showing a 4-step bedtime routine in Spanish. First line of grids should have 2 and second line grids should have 2 panels. Each step includes a cartoon-style image of a young child, a symbolic icon (e.g., book, toothbrush, moon), and a Spanish label in bold, rounded font. Steps: 1. Apagar la luz, 2. Acostarse, 3. Abrazar el peluche, 4. Cerrar los ojos. Style: soft colors, white background, and clean layout for young children.

prompt

A horizontal 1536x1024 aspect ratio infographic for toddlers showing a 10-step bedtime routine in Spanish. First line of grids should have 5 and second line grids should have 5 panels. Each step includes a cartoon-style image of a young child, a symbolic icon (e.g., book, toothbrush, moon), and a Spanish label in bold, rounded font. Steps: 1. Bañarse, 2. Ponerse el pijama, 3. Cepillar los dientes, 4. Leer dos libros, 5. Apagar la luz, 6. Acostarse, 7. Abrazar el peluche, 8. Cerrar los ojos, 9. Respirar profundo, 10. Dormir. Style: soft colors, white background, and clean layout for young children.

Prompt

A horizontal 1536x1024 aspect ratio infographic for toddlers showing a 10-step bedtime routine in English. The first row should have 5 panels, and the second row should have 5 panels. Each step includes a cartoon-style image of a young child, a symbolic icon (e.g., book, toothbrush, moon), and an English label in bold, rounded font. Steps: 1. Take a bath, 2. Put on pajamas, 3. Brush teeth, 4. Read two books, 5. Turn off the light, 6. Lie down, 7. Hug the stuffed animal, 8. Close your eyes, 9. Take a deep breath, 10. Sleep. Style: soft colors, white background, and clean layout for young children.

It did not draw grid 4 and 10