NVIDIA Tools Extension API (NVTX): Annotation Tool for Profiling Code in Python and C/C++

Originally published at: NVIDIA Tools Extension API (NVTX): Annotation Tool for Profiling Code in Python and C/C++ | NVIDIA Developer Blog

As PyData leverages much of the static language world for speed including CUDA, we need tools which not only profile and measure across languages but also devices, CPU, and GPU.  While there are many great profiling tools within the Python ecosystem: line-profilers like cProfile and profilers which can observe code execution in C-extensions like PySpy/Viztracer. …