Nvidia's new Vera Rubin architecture, announced at CES 2025, promises 10x lower inference costs and 4x fewer GPUs for training compared to Blackwell. While the Rubin GPU delivers 50 petaflops of 4-bit computation, the real innovation lies in six new chips working together through extreme co-design. The NVLink6 switch doubles

5m read timeFrom spectrum.ieee.org
Post cover image
Table of contents
Expanded “in-network compute”Scaling out and across

Sort: