I did a benchmarking project a year and a half ago using the Jetson TK1 and am hoping to perform the same task again on the TX1.
I know that the TX1 is an upgrade in several regards (more CUDA cores, texture units, etc.) but I don’t believe I should have to make any changes to my code.
I have some globals defining max_threads_per_block as well as total_shared_memory_per_block, but these values appear to be the same (1024 and 49152) for the TK1 and TX1.
Am I missing anything, or should porting code between the devices be straightforward assuming no use of peripherals or external devices?