Lumiere is a text-to-video diffusion model that can synthesize realistic and coherent videos. It uses a Space-Time U-Net architecture and diffusion probabilistic models for video generation. It has applications in stylized generation and conditional generation. Lumiere outperforms competitors in terms of motion magnitude,

2m read timeFrom blog.gopenai.com
Post cover image
Table of contents
Paper Review: Lumiere: A Space-Time Diffusion Model for Video GenerationThe approachApplicationsEvaluations

Sort: