AI systems are rapidly advancing in mathematical reasoning, with top models like ChatGPT 5.2 Pro and Claude Opus 4.6 now solving over 40% of Frontier Math's hardest problems—up from just 2% when the benchmark launched in late 2024. Google DeepMind's Aletheia achieved autonomous, publishable PhD-level math results. New

5m read time From spectrum.ieee.org
Post cover image
Table of contents
AI takes on PhD level mathematicsThe First Proof challengeA new frontier for AI

Sort: