Hello.
I am interested in the memory addressing scheme used in TX2.
One question was raised while looking at the CUDA Module information through the device query.
It is the size of Total Global Memory.
TX2 has twice the size of memory compared to TX1. Why does it show this output?
I have made TX2 Flash with JetPack 3.0 provided on the homepage.
I would like to support up to 8GB of memory on TX2 because I am requesting a high amount of memory from the ongoing project. Anyone have any ideas?
TX1
root@tegra-ubuntu:~/test# ./query
CUDA Device Query...
There are 1 CUDA devices.
CUDA Device #0
Major revision number: 5
Minor revision number: 3
Name: NVIDIA Tegra X1
Total global memory: 4188778496
Total shared memory per block: 49152
Total registers per block: 32768
Warp size: 32
Maximum memory pitch: 2147483647
Maximum threads per block: 1024
Maximum dimension 0 of block: 1024
Maximum dimension 1 of block: 1024
Maximum dimension 2 of block: 64
Maximum dimension 0 of grid: 2147483647
Maximum dimension 1 of grid: 65535
Maximum dimension 2 of grid: 65535
Clock rate: 998400
Total constant memory: 65536
Texture alignment: 512
Concurrent copy and execution: Yes
Number of multiprocessors: 2
Kernel execution timeout: Yes
Press any key to exit...
TX2
root@tegra-ubuntu:~/test# ./query
CUDA Device Query...
There are 1 CUDA devices.
CUDA Device #0
Major revision number: 6
Minor revision number: 2
Name: GP10B
Total global memory: 3940433920
Total shared memory per block: 49152
Total registers per block: 32768
Warp size: 32
Maximum memory pitch: 2147483647
Maximum threads per block: 1024
Maximum dimension 0 of block: 1024
Maximum dimension 1 of block: 1024
Maximum dimension 2 of block: 64
Maximum dimension 0 of grid: 2147483647
Maximum dimension 1 of grid: 65535
Maximum dimension 2 of grid: 65535
Clock rate: 1300500
Total constant memory: 65536
Texture alignment: 512
Concurrent copy and execution: Yes
Number of multiprocessors: 2
Kernel execution timeout: No
Press any key to exit...
PS, Kernel execution timeout in TX2 is intentional.