Nsight system showing wrong behaviour

manver · June 3, 2024, 4:02pm

Hi,
I’m profilling my code using nsight systems with cuda 11.3 .
Here is the code i"m using for profilling ::

int main(int argc, char **argv)
{
    // Profilling Tags
    nvtxMarkA("XX");
    nvtxRangePushA("XX");

    // Initializing the Python
    Py_Initialize();

    // Importing the Os and appending Path in python
    PyRun_SimpleString("import sys\n"
                       "import os\n"
                       "sys.path.append(os.getcwd())\n");

    // Setting the precision and Type of Program
    Set_precision();

    // Setting the communications
    set_communications();

    // Setting the signal for interupt
    signal(SIGINT, signal_callback_handler);

    if (HYDRO_FLAG)
    {

        if (!precision.compare("single"))
        {
            Hydro<T1_f, T2_f>();
        }

        if (!precision.compare("double"))
        {
            Hydro<T1_d, T2_d>();
        }
    }

    if (SCALAR_FLAG || RBC_FLAG)
    {

        if (!precision.compare("single"))
        {
            Scalar<T1_f, T2_f>();
        }

        if (!precision.compare("double"))
        {
            Scalar<T1_d, T2_d>();
        }
    }

    if (MHD_FLAG)
    {
        if (!precision.compare("single"))
        {
            MHD<T1_f, T2_f>();
        }

        if (!precision.compare("double"))
        {
            MHD<T1_d, T2_d>();
        }
    }

    // Python Finalize
    Py_Finalize();
    nvtxRangePop();
    
    return 0;
}

But somehow when i’m opening the profilling file . Instead of 1 it is showing 3 XX runs . WHY ?? Can anyone help

Here , in the screenshot also you can see 3 calls . while it should be only one .

dofek · June 3, 2024, 6:20pm

Hi manver,
NVTX ranges that wrap CUDA kernel launches are projected from the CPU onto the GPU, creating GPU-side annotations.
That is why the Events view displays multiple ranges with the same name instead of the single range you expected.

manver · June 4, 2024, 10:45am

so whats the solution

Topic		Replies	Views
How to control profiling start time using Nsight System gui like --capture-range=cudaProfilerApi in cli Profiling Linux Targets nsight	12	4606	April 4, 2023
NVTX display problem Profiling Linux Targets	6	1599	December 1, 2023
Nsight Systems does not collect CUDA events Profiling Linux Targets	21	10015	January 11, 2023
Profile command cannot be used more than once with the same agent Profiling Linux Targets	6	1606	July 23, 2020
Nsight nsys not collecting any CUDA kernel data (2023.1.2.43-32377213v0) Profiling Linux Targets	19	2987	September 14, 2023
Profiler stuck while profiling a range Nsight Compute	1	2305	November 20, 2023
Why does not the Nsight systems work well? Profiling Linux Targets	2	758	December 27, 2023
Nsys Does not Show the kernels output Profiling Embedded Targets	21	3622	October 20, 2022
Nsight compute failed to profile with nvtx ranges in pytorch Nsight Compute pytorch , profiling	4	1580	September 19, 2023
NVTX doesn't appear in timeline Profiling Linux Targets	1	787	December 19, 2019

Nsight system showing wrong behaviour

Related topics