Burn 0.20 introduces CubeK, a high-performance multi-platform kernel system built on CubeCL that enables unified CPU and GPU execution across NVIDIA CUDA, AMD ROCm, Apple Metal, WebGPU, and Vulkan. The release aims to deliver peak performance on diverse hardware without maintaining fragmented codebases, with benchmarks showing significantly lower execution times compared to LibTorch and ndarray. The update also includes a complete overhaul of the ONNX import system and various stability improvements.
1 Comment
Sort: