Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

Stanford researchers introduce HipKittens, a C++ programming framework for writing high-performance AMD GPU kernels that matches or exceeds AMD's hand-optimized assembly implementations. The framework uses tile-based abstractions that generalize across GPU architectures, achieving state-of-the-art performance on attention mechanisms, GEMM operations, and other AI workloads with significantly less code (~500 lines vs raw assembly). The work addresses the software gap preventing AMD GPUs from competing with NVIDIA in AI workloads, demonstrating that peak AMD performance is achievable without raw assembly programming.

HipKittens: Fast and Furious AMD Kernels

Building towards multi-silicon AI systems

Climbing out of the CUDA moat: Introducing HipKittens