cutil cuda sdk utility

middlesummer · December 22, 2007, 7:53pm

I try cudaGetDeviceCount recently. From demo file multigpu.cu the comments say
// Note that in order to detect multiple GPUs in your system you have to disable
// SLI in the nvidia control panel. Otherwise only one GPU is visible to the
// application. On the other side, you can still extend your desktop to screens
// attached to both GPUs.
I have a window XP with NVidia 8500 and Telsa C870. If disable SLI, I get report of 2 GPUs and processiog time is slow (2000ms). If do not disable SLI, I det report of 1 GPU. The processing time is fast (30ms).
If I have C870, should I get report of 128 GPUs?
Why 2 GPUs process slow than 1 GPU?

Thanks for help!

AndreiB · December 22, 2007, 9:14pm

I guess that with SLI disabled CUDA uses 8500 which is ‘first’. It’s a slow card, so 200 ms is okay for it.

When you turn SLI on CUDA no longer detects 8500 as valid device and swithces to Tesla, which is fast.

No, you shouldn’t get 128 GPUs, you should get as much as installed in your system. And your system has 2. 128 is number of stream processors on board of Tesla, this kind of information is available with other SDK functions.

And it’s not 2 GPUs are slower, it is 8500 which is slower than Tesla. Those SDK samples do not demonstrate simultaneous calculations on several CUDA devices.

middlesummer · December 23, 2007, 3:20pm

Thanks a lot for help! This is really a CUDA device’s number not the stream processors’ number. For the application, we should always enable SLI and use Tesla as many as possible. External Media

DenisR · December 23, 2007, 3:30pm

No, you do not need to use SLI. You can query the CUDA-capable devices in your computer and use only the fastest one.

middlesummer · January 1, 2008, 8:51pm

I run a test with cublasStrsv and compare the result with strsv.c runs on CPU. To my surprise, with same data set, for a matrix upto 1000X1000, CPU is fast than GPU. Only about first 10 data of the solution of GPU are right, all others are divergent. But the solutions of CPU are all right. What’s wrong with cublasStrsv?

DenisR · January 2, 2008, 7:13am

1000x1000 is maybe not big enough for GPU overhead to be cancelled out. And GPU does float’s, while CPU does extended precision floating point calculation, so results may vary a bit. (Or a lot if your algorithm is susceptible to small variances)

Topic		Replies	Views
CUDA - Number of Devices Available Fewer devices available when SLI is disabled. Slower with SLI ena CUDA Programming and Performance	3	4091	October 21, 2009
2 Graphic cards, cudaGetDeviceCount() find only 1 CUDA Programming and Performance	4	1341	September 16, 2009
3 GTX 260 cards cudaGetDeviceCount CUDA Programming and Performance	0	1885	June 24, 2009
SLI off and cudaGetDeviceCount = 1 (instead 2?) CUDA Programming and Performance	0	3682	January 20, 2010
Dual GTX 295 detects only one CUDA device CUDA Programming and Performance	0	4180	July 30, 2009
double C870 tesla in SLI SLI mode doesn't work... CUDA Programming and Performance	5	3923	July 5, 2008
Dual GPU Laptop CUDA Programming and Performance	8	16525	February 5, 2008
Programs don't use my 2 Geforce SLI CUDA Programming and Performance	2	4706	March 5, 2008
Question: CUDA with multiple devices in SLI mode CUDA Programming and Performance	3	5944	March 13, 2009
CUDA + SLI on WinXP CUDA Programming and Performance	9	7167	February 25, 2009

cutil cuda sdk utility

Related topics