cuTile.jl is a new Julia package that brings NVIDIA CUDA Tile-based GPU programming to Julia, mirroring the cuTile Python API while using idiomatic Julia syntax. It abstracts away thread/warp management, letting developers write kernels at the tile level while the compiler handles hardware mapping including tensor cores.

6m read timeFrom developer.nvidia.com
Post cover image
Table of contents
What is tile-based GPU programming?Idiomatic Julia kernelsPerformance of cuTile.jlHow cuTile.jl worksCurrent status of cuTile.jlGetting started

Sort: