Can you reverse engineer our neural network?

This title could be clearer and more informative.Try out Clickbait Shieldfor free (5 uses left this month).

Jane Street published a CTF-style ML puzzle where solvers were given a complete neural network (weights included) and had to reverse engineer what it computes using mechanistic interpretability techniques. The post walks through one solver's journey: discovering the network implements MD5 hashing via hand-crafted integer

14m read timeFrom blog.janestreet.com
Post cover image
Table of contents
The problemA solutionAnother puzzle

Sort: