A comprehensive walkthrough of how diffusion models work, covering four main areas: training objectives (ELBO, denoising matching, score-based interpretation), conditional generation via classifier guidance and classifier-free guidance, high-resolution image synthesis (cascaded diffusion, latent diffusion models), and sampling acceleration techniques (DDIM, progressive distillation, consistency models, LoRA). The content derives the math behind forward/reverse diffusion processes, explains three equivalent training interpretations (predicting clean image, predicting noise, predicting score), and surveys practical methods to make diffusion models faster and more controllable.

17m watch time

Sort: