Hello,
we noticed that some CUDA operations on our Xavier were running pretty slowly.
We have a very small test program, which we ran with different nvpmodel settings (we are on JetPack 4.4):
nvpmodel:
30w_all = 10.4ms
maxn = 7.7ms
maxn + clocks = 3.75ms
Could someone please verify, if these runtimes are normal for the Xavier module?
The Xavier module of one of our partners had much faster runtimes:
nvpmodel:
30w_all: 2.3ms
maxn+clocks: 0.9ms
How to run the test:
- unzip
- ./build.sh
- ./profile.sh
To report the runtime, open the newly created “timeline.nvvp” with NVVP.
In the “GPU Details” window (lower left) there should be one operation called “count(int …”.
Please report the avg. duration for that operation and the nvpmodel you used.
Thanks in advance!
Here are the files:
CUDA_test.zip (1.2 KB)