Can LLMs Be Computers?

Researchers at Percepta demonstrate that a transformer can act as a full computer by implementing a WebAssembly interpreter inside transformer weights, enabling arbitrary C programs to execute for millions of steps entirely within the model's inference loop — no external tools required. The key innovation is restricting attention head dimensions to 2D, which reframes attention lookups as convex-hull queries solvable in logarithmic time rather than linear scans over the full KV cache. This 'Exponentially Fast Attention' path reduces per-step decoding cost from O(n) to O(log n), making long execution traces practical. Demos include solving the world's hardest Sudoku and running the Hungarian algorithm for min-cost matching at over 30k tokens/sec on CPU. Future directions include hybrid fast/slow architectures, compiling programs directly into weights, and growing AI systems incrementally like software libraries.

#llm

#transformers

Mar 13•22m read time•From percepta.ai

Table of contents

TL;DR Motivation: LLMs cannot compute How we turned LLMs to computers What does computation mean?More demos: Sudoku How can computation be encoded?The key unlock: Exponentially Fast Attention So what is next?Closing thoughts

Comment

Bookmark

Copy

Sort: