Results of running "deviceQuery" on Amazon EC2 GPU Instance Output of running the command de

I’m working quite a lot with Amazon’s GPU in the Cloud offering and I thought the CUDA community might find it interesting to see the output from running deviceQuery on one of their compute nodes. Note that Amazon’s GPU AMI does not come with the CUDA SDK and Examples installed. You have to download and install it.

./deviceQuery Starting...

CUDA Device Query (Runtime API) version (CUDART static linking)

There are 2 devices supporting CUDA

Device 0: "Tesla M2050"

  CUDA Driver Version:                           3.20

  CUDA Runtime Version:                          3.10

  CUDA Capability Major revision number:         2

  CUDA Capability Minor revision number:         0

  Total amount of global memory:                 2817982464 bytes

  Number of multiprocessors:                     14

  Number of cores:                               448

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       49152 bytes

  Total number of registers available per block: 32768

  Warp size:                                     32

  Maximum number of threads per block:           1024

  Maximum sizes of each dimension of a block:    1024 x 1024 x 64

  Maximum sizes of each dimension of a grid:     65535 x 65535 x 1

  Maximum memory pitch:                          2147483647 bytes

  Texture alignment:                             512 bytes

  Clock rate:                                    1.15 GHz

  Concurrent copy and execution:                 Yes

  Run time limit on kernels:                     No

  Integrated:                                    No

  Support host page-locked memory mapping:       Yes

  Compute mode:                                  Default (multiple host threads can use this device simultaneously)

  Concurrent kernel execution:                   Yes

  Device has ECC support enabled:                Yes

Device 1: "Tesla M2050"

  CUDA Driver Version:                           3.20

  CUDA Runtime Version:                          3.10

  CUDA Capability Major revision number:         2

  CUDA Capability Minor revision number:         0

  Total amount of global memory:                 2817982464 bytes

  Number of multiprocessors:                     14

  Number of cores:                               448

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       49152 bytes

  Total number of registers available per block: 32768

  Warp size:                                     32

  Maximum number of threads per block:           1024

  Maximum sizes of each dimension of a block:    1024 x 1024 x 64

  Maximum sizes of each dimension of a grid:     65535 x 65535 x 1

  Maximum memory pitch:                          2147483647 bytes

  Texture alignment:                             512 bytes

  Clock rate:                                    1.15 GHz

  Concurrent copy and execution:                 Yes

  Run time limit on kernels:                     No

  Integrated:                                    No

  Support host page-locked memory mapping:       Yes

  Compute mode:                                  Default (multiple host threads can use this device simultaneously)

  Concurrent kernel execution:                   Yes

  Device has ECC support enabled:                Yes

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.20, CUDA Runtime Version = 3.10, NumDevs = 2, Device = Tesla M2050, Device = Tesla M2050

PASSED