A lightweight character-level diffusion model for text generation with 10.7 million parameters, trained on the Tiny Shakespeare dataset. The implementation modifies nanochat GPT architecture and includes pre-trained weights, training scripts, text generation capabilities, and visualization tools for the diffusion denoising

2m read timeFrom github.com
Post cover image
Table of contents
InstallationQuick StartDefault ConfigFile Structure

Sort: