architecture on gpu

hello everybody,

is there any command from linux bash to know the type of architecture of the nvidia graphic device installed in the system?
i mean number of cores, device memory size, shared memory size in every SM, etc…

the other idea was: given the type of graphic device, look for a datasheet…
but i’ve only seen a brief description on nVidia website.

maybe it’s a stupid question, i’m pretty new with cuda programming…

thanks

The CUDA SDK includes a command called deviceQuery that does exactly that and give output like this:

CUDA Device Query (Runtime API) version (CUDART static linking)

There are 2 devices supporting CUDA

Device 0: "GeForce GTX 275"

  CUDA Driver Version:						   3.0

  CUDA Runtime Version:						  3.0

  CUDA Capability Major revision number:		 1

  CUDA Capability Minor revision number:		 3

  Total amount of global memory:				 938803200 bytes

  Number of multiprocessors:					 30

  Number of cores:							   240

  Total amount of constant memory:			   65536 bytes

  Total amount of shared memory per block:	   16384 bytes

  Total number of registers available per block: 16384

  Warp size:									 32

  Maximum number of threads per block:		   512

  Maximum sizes of each dimension of a block:	512 x 512 x 64

  Maximum sizes of each dimension of a grid:	 65535 x 65535 x 1

  Maximum memory pitch:						  2147483647 bytes

  Texture alignment:							 256 bytes

  Clock rate:									1.46 GHz

  Concurrent copy and execution:				 Yes

  Run time limit on kernels:					 Yes

  Integrated:									No

  Support host page-locked memory mapping:	   Yes

  Compute mode:								  Prohibited (no host thread can use this device)

Device 1: "GeForce GTX 275"

  CUDA Driver Version:						   3.0

  CUDA Runtime Version:						  3.0

  CUDA Capability Major revision number:		 1

  CUDA Capability Minor revision number:		 3

  Total amount of global memory:				 939327488 bytes

  Number of multiprocessors:					 30

  Number of cores:							   240

  Total amount of constant memory:			   65536 bytes

  Total amount of shared memory per block:	   16384 bytes

  Total number of registers available per block: 16384

  Warp size:									 32

  Maximum number of threads per block:		   512

  Maximum sizes of each dimension of a block:	512 x 512 x 64

  Maximum sizes of each dimension of a grid:	 65535 x 65535 x 1

  Maximum memory pitch:						  2147483647 bytes

  Texture alignment:							 256 bytes

  Clock rate:									1.46 GHz

  Concurrent copy and execution:				 Yes

  Run time limit on kernels:					 No

  Integrated:									No

  Support host page-locked memory mapping:	   Yes

  Compute mode:								  Default (multiple host threads can use this device simultaneously)

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 4249427, CUDA Runtime Version = 3.0, NumDevs = 2, Device = GeForce GTX 275, Device = GeForce GTX 275

thanks!!
i think i’ve read this somewhere in the programming guide or in the gettin’ started document,
but of course i forgot when i needed… External Image