Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

DeepSeek-V4-Flash, a local model competitive with low-end frontier models for agentic coding, makes LLM activation steering practically accessible for the first time. The post explains how steering works — extracting concept vectors from model activations and boosting them during inference — and explores why it hasn't been widely adopted: big labs don't need it, API users can't access weights, and basic use cases are outcompeted by prompting. The author is cautiously skeptical but intrigued by potential applications like steering for 'unpromptable' concepts or compressing large context into implicit memory. The open-source project DwarfStar 4 by antirez is highlighted as an early example of steering built into a local model runner.

DeepSeek-V4-Flash means LLM steering is interesting again