Will sora be able to be conditioned with images / videos?

I was wondering if sora could be conditioned with something else than prompts so that it can create videos from images or extend videos or if it would be only prompt based. Any ideas?

All the publicly available information is available here:

In addition, there is a first impression of Sora on OpenAI’s blog site.