We have encountered a strange problem, our code runs slower on tx2 compared with tx1. As we expected, the performance should be at least as good as on tx1, however it is usually two to three times slower for some programs. The setting for the code is exactly the same with that of tx1, except the -gencode=arch (tx2:62 tx1:53). Before we run on tx2, we also set tx2 environment using:
nvpmodel -m 0 ./jetson_clocks.sh
We have packed the code and upload at https://github.com/WANG-KX/tx2_issue, you can downloa
d and run it. Details are also provided in the README file.
Is there any other environment settings? Why the code is so slow on tx2?
Thanks for your help!