CUDA on Windows much slower than on linux

jony · January 24, 2013, 3:28pm

Hi there, I have a CUDA application which I have to run on both Windows and Linux OS. However, the perfomance I get for the same test case on the same card (TESLA C1060) are very different: on Windows (Vista 64bit) the time needed to run my application is nearly double than on Linux (openSUSE 11.1 64bit). Do you have any idea? Many thanks,
jony

njuffa · January 24, 2013, 6:49pm

Not knowing anything about the properties of your application or very little about your setup, my guess is that you are running with default WDDM driver on Windows, and this driver model has quite a bit of overhead. The CUDA driver tries to minimize the impact of this through batching of work, but it can still have a significant impact.

On Windows, you would want to use the TCC driver, which uses a different driver model and avoids most of the overhead inherent in the WDDM model.

jony · January 25, 2013, 6:37pm

Thanks njuffa, I have realized that I was using a different version of the cuda compiler (5.0) on Windows than that one used on Linux (4.2). Installing the 4.2 on Windows I get now “only” a factor of 1.3 slower.

Even the drivers are different, but I can use the same as the installer doesn’t recognize my 2 GPUs (I use a Quadro for visualisation and a TESTLA for computing).

How should I switch between WDDM to TCC driver?

njuffa · January 25, 2013, 7:17pm

Sorry, I have no first hand knowledge of using the TCC driver. Best I can tell switching to TCC mode is accomplished via the -dm option of nvidia-smi.

CudaaduC · January 25, 2013, 10:06pm

Strange, I have had the opposite experience with the GTX 680. Are you using Visual Studio? You do need to correctly configure the compiler settings and select x64 if you are using 64 bit operating system.
Without any optimizations I can get 934 Gflops out of the 680 using CUBLAS Matrix Mul, while in ubuntu I was only getting ~500 Gflops.

I would also recommend Nividia Nsight for use with Visual Studio. It is free and quite useful for debugging.

Lev · January 26, 2013, 7:14pm

You probably have a lot of other differences too, compiler options etc. Maybe you use less powerful gpu on windows. If driver does not recognize second.

Topic		Replies	Views
Is there anyone know about the performance at linux and windows? CUDA Programming and Performance	4	997	November 2, 2012
Running multiple CUDA apps on same GPU card Serious performance drop CUDA Programming and Performance	1	1138	March 14, 2011
CUDA slower in Windows 7 than in Windows XP same computer, two OSs, different run times CUDA Programming and Performance	21	18972	November 11, 2009
CUDA on WIN7 is much slower than on WIN XP same computer, two OSs, two different run times CUDA Programming and Performance	2	15656	November 11, 2009
Comparison Linux vs windows of "cudaDeviceSynchronize" CUDA Programming and Performance	7	2392	August 13, 2013
Slow CUDA performances on Linux VS cuda Windows CUDA Programming and Performance	3	2329	December 26, 2012
Performance difference between Tesla and system where Cuda GPU is used as display device CUDA Programming and Performance	8	5913	September 2, 2009
Tesla K40/ K6000 performance discrepancy between Linux / Windows CUDA Programming and Performance	5	1735	February 23, 2014
WDDM on windows 7 and kernel call overhead CUDA Programming and Performance	1	1323	May 20, 2010
Building a CUDA pc - linux vs windows some quick questions CUDA Programming and Performance	10	20958	July 1, 2010

CUDA on Windows much slower than on linux

Related topics