Advanced API Performance: Intrinsics

Originally published at: https://developer.nvidia.com/blog/advanced-api-performance-intrinsics/

Intrinsics can be thought of as higher-level abstractions of specific hardware instructions. They offer direct access to low-level operations or hardware-specific features, enabling increased performance. In this way, operations can be performed across threads within a warp, also known as a wavefront. Recommended Wave intrinsics can noticeably speed up your shaders. Many sorting or reduction algorithms…