Advent of Code (AoC) is an annual coding competition. This post explores how OpenAI's o1-mini model performs remarkably well on AoC puzzles, solving 86% of the puzzles correctly and quickly. Using a robust test harness and standard prompts, the model succeeded where previous generations struggled. However, there's controversy

6m read timeFrom blog.scottlogic.com
Post cover image
Table of contents
The test harnessJust how good is o1-mini?Failure modesClosing thoughts
2 Comments

Sort: