Summary
As far as I understand and uncovered, a document for the character training for Claude is compressed in Claude's weights. The full document c…

Lobsters is a community-driven platform for sharing and discussing links to articles, tutorials, and projects related to technology and programming. Readers can learn about a wide range of topics, from software development and system administration to cybersecurity and artificial intelligence. With user submissions, comments, and voting, Lobsters provides a platform for collaborative learning and knowledge sharing among technology enthusiasts.

Lobsters

A researcher extracted what appears to be Claude 4.5 Opus' internal training document (dubbed the "soul document") using prefill techniques and consensus-based sampling. The document reveals Anthropic's approach to aligning Claude, emphasizing helpfulness balanced with safety, revenue considerations for mission sustainability, and hierarchical trust relationships between Anthropic, operators, and users. The extraction method involved iterative API calls with temperature 0 and prompt caching to achieve reproducible outputs. Community discussion centers on authenticity verification, implications of revenue mentions in alignment objectives, and whether this represents genuine weight compression versus runtime injection.

Claude 4.5 Opus' Soul Document — LessWrong