HART, a hybrid autoregressive transformer, combines the benefits of autoregressive and diffusion models to generate high-quality images about nine times faster and with fewer computational resources. This approach involves an autoregressive model capturing the big picture quickly and a small diffusion model refining the details, making it suitable for applications like training self-driving cars and creating video game scenes.

6m read timeFrom news.mit.edu
Post cover image

Sort: