Chinese text-to-image diffusion model ERNIE-ViLG got upgraded to version 2.0. It can scale up the model to 24 billion parameters, which is 10 times more than in Stable Diffusion.
Sort: