Does the display output on Tesla C2050 reduce performance?

Uncle_Joe · June 8, 2010, 11:51pm

I just ran the SDK examples on a C2050 to see the difference between the C1060 and at 1st, didn’t see much.

I noticed the device to device memory bandwidth were comparable: 73Gb/s for C1060 and 78Gb/s for C2050. Then I saw the monitor was attached to the Tesla, which I thought would use the GPU significantly. I disabled that monitor in Windows and made a Quadro 290 the main display, and I don’t know if that improved performance - memory bandwidth went to 79Gb/s (maybe - since monitor refresh consumes very little) but compute performance for things like convolutionFFT2D didn’t improve.

Then, I saw you can disabled memory error correction and the bandwidth went up to 90Gb/s, along with compute performance for most applications.

I still want to know: how significant is the impact of the display output on Tesla C2050 on CUDA perforamance?

Simon_Green · June 9, 2010, 7:31am

In theory display scan-out uses a small amount of additional memory bandwidth, but I’ve never seen it make a measurable difference.

avidday · June 9, 2010, 8:02am

Probably the biggest thing I would worry about running a display manager on Tesla is the driver watchdog timer.

seibert · June 9, 2010, 3:09pm

3D accelerated window managers have been known to slow down CUDA programs on older cards by adding delays between kernel launches while the GPU handles GUI updates. It’s more noticeable with CUDA programs that launch lots of short kernels. Fermi is supposed to reduce that context switch overhead, though.

Uncle_Joe · June 9, 2010, 3:21pm

OK, sounds about right. I didn’t expect drawing the screen to take that much resources, assuming Windows/Xserver only redraws when needed and only the invalidated sections. I suppose if 3D graphics is used (I don’t use), then incremental redraws can’t be done.

Topic		Replies	Views
implications of using Tesla C2050's DVI output for display? CUDA Programming and Performance	2	6429	April 10, 2011
Tesla c2050 maximum runtime CUDA Programming and Performance	4	6087	March 25, 2010
Performance difference between Tesla and system where Cuda GPU is used as display device CUDA Programming and Performance	8	5904	September 2, 2009
Tesla C2050 slower than GeForce 8800? CUDA Programming and Performance	14	20903	April 20, 2011
qudra fx 1700 VS tesla c1060 How much performance gain I can expect? CUDA Programming and Performance	3	2157	January 23, 2010
Low bandwidth numbers for Tesla c2050 Very low bandwidth numbers CUDA Programming and Performance	6	1108	February 2, 2011
Tesla c2050 vs Tesla T10 Processor: which is normally faster? CUDA Programming and Performance	4	3941	September 1, 2010
CUDA Visual Profiler Dies During Long Programs CUDA Programming and Performance	2	3424	August 5, 2010
Understanding the memory latency when using CUDA profiler vs cudaEventRecord CUDA Programming and Performance	9	2082	November 11, 2010
bandwithTest.exe on Tesla c2050 possible slow speed CUDA Programming and Performance	5	3242	February 17, 2011

Does the display output on Tesla C2050 reduce performance?

Related topics