Collection

Anthropic's Natural Language Autoencoders turn Claude's internal activations into readable text

2 sources
Post cover image

Sort: