Alignment is not a constraint on capable AI systems. Alignment is what capability is at sufficient depth. OpenAI and Anthropic have been running this experiment for two years.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

AI alignment and capability are not separate concerns but fundamentally intertwined. Models that deeply understand human intent, values, and context are inherently more capable than those trained purely for benchmark performance. Anthropic's approach of embedding alignment researchers throughout the training process has produced consistently strong models like Claude Opus 4.5, while OpenAI's separation of alignment and capability work led to a two-year cycle of issues including sycophancy, overcorrection to coldness, and declining user engagement. The evidence suggests that building coherent models with integrated human values is the path to AGI, not a constraint on it.