Quadro NVS 140M supported?

ylai · July 17, 2007, 8:34pm

When I tried to run a few of the examples from the SDK on a Quadro NVS 140M (which according to Wikipedia is supposedly based on 8400), some of them suceed and some of them failed. Adding a few lines to check the results shows e.g. for convolutionSeparable, that the GPU result is 0 for the entire array.

So:

Is Quadro NVS 140M supported?
Why does some of the programs succeed (convolutionFFT2D, dwtHaar1D, fluidsGL, imageDenoising, matrixMul, MersenneTwister, …), but other fail (convolutionSeparable, convolutionTexture, histogram64, …)?

The setup is a Lenovo ThinkPad T61 running Red Hat Enterprise Linux 5 Desktop in 64-bit, driver version string is “NVIDIA GLX Module 100.14.11 Wed Jun 13 17:16:40 PDT 2007”.

netllama · July 17, 2007, 8:35pm

How are the examples failing?
What is the full output?
Do they fail for both gpu & emu builds?

e.ping · July 17, 2007, 9:04pm

It looks like this has 128MB of video memory, so some examples are probably too big. Can you please try the deviceQuery sample? This will display the gpu details (speed & memory)

ylai · July 17, 2007, 9:09pm

The emu build runs fine. The failure looks like this:

$ ./release/alignedTypes 

Allocating memory...

Generating host input data array...

Uploading input data to GPU memory...

Testing misaligned types...

RGBA8_misaligned...

Time: 3.095000 ms / Copy throughput: 60.182395 GB/s.

TEST FAILED

LA32_misaligned...

Time: 0.024000 ms / Copy throughput: 7761.021388 GB/s.

TEST FAILED

RGB32_misaligned...

Time: 0.015000 ms / Copy throughput: 12417.634109 GB/s.

TEST FAILED

RGBA32_misaligned...

Time: 0.015000 ms / Copy throughput: 12417.634606 GB/s.

TEST FAILED

Testing aligned types...

RGBA8...

Time: 0.014000 ms / Copy throughput: 13304.607798 GB/s.

TEST FAILED

I32...

Time: 0.014000 ms / Copy throughput: 13304.607798 GB/s.

TEST FAILED

LA32...

Time: 0.016000 ms / Copy throughput: 11641.531630 GB/s.

TEST FAILED

RGB32...

Time: 0.014000 ms / Copy throughput: 13304.607798 GB/s.

TEST FAILED

RGBA32...

Time: 0.021000 ms / Copy throughput: 8869.738925 GB/s.

TEST FAILED

RGBA32_2...

Time: 0.014000 ms / Copy throughput: 13304.607798 GB/s.

TEST FAILED

Shutting down...

Press ENTER to exit...

$ ./release/convolutionSeparable 

4096 x 4096

Initializing data...

Warm up...

GPU convolution...

GPU convolution time : 0.033000 msec //508400.487603 Mpixels/sec

Reading back GPU results...

Checking the results...

...running convolutionRowCPU()

...running convolutionColumnCPU()

...comparing the results

L1 norm: 1.000000E+00

TEST FAILED

Shutting down...

Press ENTER to exit...

Segmentation fault

$ ./release/convolutionTexture   

Initializing data...

convolutionRowGPU()

...convolutionRowGPU() time: 10.230000 msecs; //1640.001637 Mpix/s

Copying convolutionRowGPU() output back to a_Data...

...cudaMemcpyToArray() time: 0.019000 msecs; //883011.396814 Mpix/s

convolutionColumnGPU()...

...convolutionColumnGPU() time: 0.028000 msecs; //599186.267219 Mpix/s

Reading back GPU results...

Checking GPU results...

...convolutionRowCPU()

...convolutionColumnCPU()

...comparing the results

L1 norm: 1.000000E+00

TEST FAILED

Shutting down...

Press ENTER to exit...

Segmentation fault

ylai · July 17, 2007, 9:10pm

I see, this is probably the cause. I will decrease the problem size and try again.

$ ./release/deviceQuery 

There is 1 device supporting CUDA

Device 0: "Quadro NVS 140M"

  Major revision number:                         1

  Minor revision number:                         1

  Total amount of global memory:                 133496832 bytes

  Total amount of constant memory:               65536 bytes

  Total amount of shared memory per block:       16384 bytes

  Total number of registers available per block: 8192

  Warp size:                                     32

  Maximum number of threads per block:           512

  Maximum sizes of each dimension of a block:    512 x 512 x 64

  Maximum sizes of each dimension of a grid:     65535 x 65535 x 1

  Maximum memory pitch:                          262144 bytes

  Texture alignment:                             256 bytes

  Clock rate:                                    337500 kilohertz

Test PASSED

Press ENTER to exit...

e.ping · July 17, 2007, 9:28pm

Thanks. Yep, you’ve got 128MB.

Try reducing the size of DATA_W and DATA_H for the gpu from 4096 to perhaps 1024 or smaller in convolutionSeparable_kernel.cu

e.ping · July 17, 2007, 9:29pm

sorry, in convolutionSeparable.cu (not _kernel)

ylai · July 17, 2007, 9:32pm

Yes, it works! Thanks a lot!

Topic		Replies	Views
cudaError_enum error with Quadro NVS 140M CUDA Programming and Performance	1	1723	May 21, 2008
Problem with examples most of them crash CUDA Programming and Performance	6	7535	October 22, 2008
Many tests in examples are failing after a sucessful installation. CUDA Programming and Performance	1	1888	November 6, 2008
CUDA 1.1 Error on Quadro NVS 135M CUDA Programming and Performance	9	10882	March 12, 2008
Can't run any example on SDK All exemples stuck at cuda instruction CUDA Programming and Performance	4	4480	April 18, 2008
problem running demos CUDA Programming and Performance	9	8225	January 1, 2009
Run time error on Fedora 9 CUDA Programming and Performance	6	11823	March 20, 2009
Problem with CudaMemCpy in CUDA 1.1 CUDA Programming and Performance	4	6454	May 16, 2008
deviceQuery passes but other demos fail CUDA Programming and Performance	7	2550	January 22, 2009
64bit Ubuntu 8.04 + Quadro NVS290 CUDA Programming and Performance	4	8654	October 8, 2008

Quadro NVS 140M supported?

Related topics