oh… what is wrong in my tx1 board…
when I run 6_Advanced/concurrentKernels this show result like this…
GPU Device 0: “NVIDIA Tegra X1” with compute capability 5.3
Detected Compute SM 5.3 hardware with 2 multi-processors
Expected time for serial execution of 8 kernels = 0.080s
Expected time for concurrent execution of 8 kernels = 0.010s
Measured time for sample = 0.000s
Through the profiler, I understand this result means nothing is excuted in GPU.
because, I can see the no stream is run in the picture.
so, I’m curious that whether TX1 support concurrent CUDA kernel excution or It doesn’t support at all/
I’ll really wait for answer.