TheRegister's platform is a leading technology news website, offering insights into IT industry news, hardware reviews, and software updates. Through articles, analysis, and opinion pieces, TheRegister offers insights into cybersecurity threats, technology trends, and industry developments. Readers can stay updated with the latest news and analysis from the world of technology and IT business.

The Register

Anthropic researchers have developed a method to map and stabilize AI model behavior by identifying an "Assistant Axis" in neural networks. By analyzing activation patterns across models like Gemma 2, Qwen 3, and Llama 3.3, they discovered how to keep LLM responses within helpful, safe boundaries and reduce jailbreak effectiveness. The research reveals that model personas can drift during extended conversations, particularly in therapy-style exchanges, and proposes activation capping as a technique to maintain desired behavior, though production implementation requires further work.

AI researchers map models to banish 'demon' persona