CUDA performance

Hi everyone,

I done some benchmark between the NX and my laptop GPU which is a GTX930MX.
The cuda specification are almonst identical same number of core but the NX has 6 SM instead of 3 for the GTX930MX.

When I benchmark the cuda example vectoradd I don’t understand whyt the execution time on the NX is more than 3 time longer.

can someone help me understanding this difference.
Thanks in advance.

Hi,

Have you maximized the device performance first?

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

Thanks.

This improve dramatically the performance of the NX.
The execution time is of the same order of my laptop.

Thanks a lot for your quick support

Good to know this : )