GPU code can now use Rust's threads. We share the implementation approach and what this unlocks for GPU programming.

Hacker News is a community-driven platform for sharing and discussing technology news, startups, and programming-related topics. Through user submissions and comments, Hacker News offers insights into emerging technology trends, industry developments, and entrepreneurial ventures. Readers can participate in discussions, share their insights, and stay informed about the latest advancements in technology and innovation.

Hacker News

VectorWare demonstrates running Rust's std::thread on the GPU for the first time, mapping each std::thread to a GPU warp. The approach starts with only Warp 0 active (running main), with additional warps woken on thread::spawn() and blocked on thread::join(). This preserves Rust's borrow checker semantics, prevents warp divergence by construction, and unlocks large portions of the Rust ecosystem (rayon, tokio, etc.) for GPU use. The post covers the implementation details, benefits (no divergence, familiar Rust abstractions), and downsides (finite warps, expensive synchronization, memory constraints). The approach targets NVIDIA GPUs but is portable to Vulkan subgroups and HIP/ROCm wavefronts.

Rust threads on the GPU