Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

Research from Anthropic's Fellows Program examines whether AI failures stem from systematic misalignment (pursuing wrong goals coherently) or incoherence (unpredictable, inconsistent behavior). Using bias-variance decomposition across frontier models, the study finds that as tasks become harder and reasoning chains lengthen, model failures are increasingly dominated by variance rather than systematic bias. This suggests future AI risks may resemble industrial accidents caused by erratic behavior rather than coherent pursuit of misaligned objectives. The research challenges assumptions that scaling alone will produce more coherent AI systems, showing that training models to act as consistent optimizers remains fundamentally difficult.

The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity?

Measuring Incoherence: A Bias-Variance Decomposition

Why Should We Expect Incoherence? LLMs as Dynamical Systems