I was wondering how to profile deep learning model across multiple nodes (mpi machines). Simply using “nsys profile …” doesn’t work.
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Nsys for multi GPU apps | 1 | 1354 | September 10, 2018 | |
MPS capability for nsight products | 0 | 617 | November 4, 2020 | |
I want to profile multiprocess at once | 1 | 833 | January 7, 2022 | |
Support for MPS | 7 | 1027 | December 20, 2021 | |
Question about Nsight Compute's application range replay support data collection for multi-node, multi-GPU setups under NCCL | 3 | 475 | March 14, 2024 | |
DLI Course: Optimizing CUDA Machine Learning Codes With Nsight Profiling Tools | 0 | 298 | July 12, 2022 | |
DLI Course: Optimizing CUDA Machine Learning Codes With Nsight Profiling Tools | 0 | 313 | July 12, 2022 | |
ONNX model profiling using Nsight tools | 2 | 835 | November 28, 2023 | |
Can I use Nsight system on Apple silicon? | 1 | 585 | September 11, 2023 | |
[problem] Nsight System cannot collect program performance data in a multi-node distributed environment | 4 | 831 | April 20, 2023 |