Sora is OpenAI's text-to-video AI model that generates high-quality videos up to one minute long from text prompts. It combines diffusion and transformer architectures, processing text input through spatial-temporal patches and iteratively refining random noise into coherent video sequences. While it can create complex scenes
Table of contents
What is text-to-video AI?What is Sora?How does it work?What does Sora still need to improve?Conclusion1 Comment
Sort: