How to Implement Performance Metrics in CUDA C/C++

Originally published at: https://developer.nvidia.com/blog/how-implement-performance-metrics-cuda-cc/

In the first post of this series we looked at the basic elements of CUDA C/C++ by examining a CUDA C/C++ implementation of SAXPY. In this second post we discuss how to analyze the performance of this and other CUDA C/C++ codes. We will rely on these performance measurement techniques in future posts where performance optimization…