ARC-AGI-3 is a new interactive reasoning benchmark designed to measure human-like intelligence in AI agents. Unlike static puzzle benchmarks, it requires agents to explore novel environments, acquire goals dynamically, build world models, and learn continuously from experience without natural-language instructions. It measures

2m read timeFrom arcprize.org
Post cover image
Table of contents
What is ARC-AGI-3?

Sort: