cuTile.jl is a new Julia package that brings NVIDIA CUDA Tile-based GPU programming to Julia, mirroring the cuTile Python API while using idiomatic Julia syntax. It abstracts away thread/warp management, letting developers write kernels at the tile level while the compiler handles hardware mapping including tensor cores.
Table of contents
What is tile-based GPU programming?Idiomatic Julia kernelsPerformance of cuTile.jlHow cuTile.jl worksCurrent status of cuTile.jlGetting startedSort: