The Weirdly Small AI That Cracks Reasoning Puzzles [HRM]
This title could be clearer and more informative.Try out Clickbait Shieldfor free (5 uses left this month).
A hierarchical reasoning model (HRM) with only 27 million parameters outperforms large language models like DeepSeek R1 and GPT-o3 on inductive reasoning benchmarks including Sudoku and maze navigation. The model uses a two-level recurrent architecture: low-level modules handle fast, detailed computations while high-level modules build abstract representations over longer time horizons. Key techniques include input injection to maintain problem context, fixed-point gradient approximation to reduce memory usage, deep supervision across segments, and adaptive computation that stops recurrence early when a solution is found. Remarkably, HRM achieves near-perfect accuracy on challenging Sudoku problems after training on just 1,000 examples.
Sort: