A complete transformer-based Large Language Model implementation built from scratch in pure Rust using only ndarray for matrix operations. The project includes pre-training on factual text, instruction tuning for conversational AI, interactive chat mode, and full backpropagation with gradient clipping. Features a modular
Table of contents
๐ What This Is๐ Key Files to Explore๐๏ธ Architecture๐งช What The Model Learns๐ Quick Start๐ฎ Interactive Mode๐งฎ Technical Implementation๐ง Development๐ง Learning Resources๐ Dependencies๐ค Contributing1 Comment
Sort: