Through through the API, sora-2 can only generate 4, 8, or 12 second videos. They “cut off” unexpectedly, like mid-sentence the video will just end. If there is a solution for that, I would love to hear it.
I tried to switch to sora-2-pro, but it doesn’t seem to allow longer videos, I get constrained to the same 4, 8, or 12 seconds, attempting the 10, 15, or 25 second videos just gets rejected. Am I missing something?
I suspect the length of the output, and fitting content within, is just something you’ll need to prompt well - without being given a prompting guide by OpenAI.
Generative AI produces sequences, and like “chat”, might not know when you have forced a maximum output cutoff point, and the model might have training on longer outputs than can be delivered by API.
The AI making noise is mostly a side-effect, rarely useful except for novelty. A speaking video might get the mouth moving for you, for an ADR voice actor to make different generations consistent.