Research from Carnegie Mellon University demonstrates that diffusion models outperform autoregressive models in data-constrained scenarios. While autoregressive models excel with limited compute, diffusion models show superior performance when data is the bottleneck, exhibiting better resistance to overfitting and ability to
Sort: