Abstract: Generating long-form videos conditioned on large story based text input is a new and relatively unexplored task. Current text-to-video models are designed to generate short video clips ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results