A lightweight character-level diffusion model for text generation with 10.7 million parameters, trained on the Tiny Shakespeare dataset. The implementation modifies nanochat GPT architecture and includes pre-trained weights, training scripts, text generation capabilities, and visualization tools for the diffusion denoising
Sort: