Andrej Karpathy released AutoResearch, a 630-line Python script that autonomously ran 50 ML experiments overnight on a single GPU without human input. The core design rests on three primitives: a single editable asset (the training script), a scalar metric (validation bits per byte), and a time-boxed evaluation cycle. A key

10m read timeFrom thenewstack.io
Post cover image
Table of contents
What is the Karpathy Loop?Markdown is the human-agent interfaceThe Pattern That Goes Beyond ML ExperimentsThe human role shifts to experimental designWhat’s next

Sort: