ARC-AGI-3 is a new interactive reasoning benchmark designed to measure human-like intelligence in AI agents. Unlike static puzzle benchmarks, it requires agents to explore novel environments, acquire goals dynamically, build world models, and learn continuously from experience without natural-language instructions. It measures
Table of contents
What is ARC-AGI-3?Sort: