About jetson-nano device query

parody9c · July 8, 2019, 7:58am

I executed device query on my jetson-nano. Here is my result :

CUDA Device Query (Runtime API) version (CUDART static linking)

Detected 1 CUDA Capable device(s)

Device 0: “NVIDIA Tegra X1”
CUDA Driver Version / Runtime Version 10.0 / 10.0
CUDA Capability Major/Minor version number: 5.3
Total amount of global memory: 3963 MBytes (4155834368 bytes)
( 1) Multiprocessors, (128) CUDA Cores/MP: 128 CUDA Cores
GPU Max Clock rate: 922 MHz (0.92 GHz)
Memory Clock rate: 13 Mhz
Memory Bus Width: 64-bit
L2 Cache Size: 262144 bytes
Maximum Texture Dimension Size (x,y,z) 1D=(65536), 2D=(65536, 65536), 3D=(4096, 4096, 4096)
Maximum Layered 1D Texture Size, (num) layers 1D=(16384), 2048 layers
Maximum Layered 2D Texture Size, (num) layers 2D=(16384, 16384), 2048 layers
Total amount of constant memory: 65536 bytes
Total amount of shared memory per block: 49152 bytes
Total number of registers available per block: 32768
Warp size: 32
Maximum number of threads per multiprocessor: 2048
Maximum number of threads per block: 1024
Max dimension size of a thread block (x,y,z): (1024, 1024, 64)
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535)
Maximum memory pitch: 2147483647 bytes
Texture alignment: 512 bytes
Concurrent copy and kernel execution: Yes with 1 copy engine(s)
Run time limit on kernels: Yes
Integrated GPU sharing Host Memory: Yes
Support host page-locked memory mapping: Yes
Alignment requirement for Surfaces: Yes
Device has ECC support: Disabled
Device supports Unified Addressing (UVA): Yes
Device supports Compute Preemption: No
Supports Cooperative Kernel Launch: No
Supports MultiDevice Co-op Kernel Launch: No
Device PCI Domain ID / Bus ID / location ID: 0 / 0 / 0
Compute Mode:
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) >

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 10.0, CUDA Runtime Version = 10.0, NumDevs = 1
Result = PASS

but I find the differences that each device has another amount of global memory.
another result : Playing with CUDA on My NVIDIA Jetson Nano | Stephen Smith's Blog

It doesn’t make a big difference, but I want to know why is the reason.

dusty_nv · July 8, 2019, 2:34pm

Hi parody9c, since the CPU/GPU share system memory on Jetson, the slight different in the available CUDA global memory is probably related to memory being used by the kernel, kernel drivers, framebuffer size, ect.

parody9c · July 9, 2019, 8:11pm

Thank you! dusty_nv!

Topic		Replies	Views
Incorrect CUDA deviceQuery Results? Jetson Nano	6	2719	October 14, 2021
How many CUDA multiprocessors does the Jetson Nano have? Jetson Nano cuda	3	597	September 22, 2023
Why does my Jetson Orin Nano Only Has About 6.5GB Memory? Jetson Orin Nano	4	316	April 24, 2024
Jetson TX2 Cache Line Size Jetson TX2	10	2475	October 18, 2021
deviceQuery app fails with error 38 without root premissions on Jetson TX2 Jetson TX2 cuda	2	444	October 18, 2021
deviceQuery shows Jetson-tx1 GPU Max Clock rate: 72 MHz,Memory Clock rate: 13 Mhz Jetson TX1	2	1637	October 18, 2021
Jetson Nano Failed in SimpleCUFFT sample GPU-Accelerated Libraries cufft	5	832	November 15, 2021
Who can help to provide information from running CUDA's devicequery on AGX Xavier? Jetson AGX Xavier	5	999	October 18, 2021
Devicequery in jetson nana works for docker but not for kubernetes Jetson Nano docker	6	1623	October 15, 2021
Programmatically limiting GPU access Jetson Orin NX gpu-computing	4	60	July 29, 2024

About jetson-nano device query

Related topics