cuTile.jl Brings NVIDIA CUDA Tile-Based Programming to Julia

Originally published at: cuTile.jl Brings NVIDIA CUDA Tile-Based Programming to Julia | NVIDIA Technical Blog

NVIDIA CUDA Tile is one of the most significant additions to NVIDIA CUDA programming and unlocks automatic access to tensor cores and other specialized hardware. Earlier this year, NVIDIA released cuTile for Python, giving Python developers a natural way to write high-performance GPU kernels.  Now, the same programming model is available in Julia through cuTile.jl.…