The End of Frozen LLMs? (Google’s Hope Explained)
This title could be clearer and more informative.Try out Clickbait Shieldfor free (5 uses left this month).
Google has published a paper introducing a new model architecture called Hope, built on a paradigm called Nested Learning. The core problem it addresses is that current LLMs are 'frozen in time' after training and suffer from catastrophic forgetting when updated. Inspired by human neuroplasticity and brain oscillations, Hope uses Neural Learning Modules (NLMs) that each have their own learning objective, learning rate, and update frequency. These modules are assembled into a Continuum Memory System where different components update at vastly different timescales — from every 16 tokens to every 16 million tokens — creating a hierarchy of learning that mitigates catastrophic forgetting. A key component called Self-Modifying Titans replaces the attention mechanism, using an associative memory that updates its own weights in real-time via local mini-backpropagations. Benchmark results show Hope outperforms Transformer baselines on language modeling, common-sense reasoning, and continual learning tasks, while avoiding the quadratic cost of attention for long sequences.
•15m watch time
1 Comment
Sort: