Japanese Stable Diffusion is a language-specific text-to-image model. It was trained mainly on the English subset of LAION-5B and can generate high-performance images simply by entering text prompts. The code is based on Hugging Face and GitHub. In the 1st stage, the latent diffusion model is fixed and we replaced the English text encoder with a Japanese-specific message encoder. After this, the model is replaced with aJapanese-specific language encoder, which is trained.

6m read timeFrom huggingface.co
Post cover image
Table of contents
Stable DiffusionJapanese Stable Diffusionrinna’s Open StrategyWhat’s Next?

Sort: