Tesla C1060 vs GTX 480 Double precision performance

geek1999 · September 27, 2010, 9:52am

Is it worthwhile to switch the Tesla C1060 cards to GTX 480 in terms of double precision performance?

For single precision, GTX 480 runs over 50% faster than Tesla C1060 from my experience. I think it is worthwhile to switch the cards for single precision calculations. But I don’t know the case of double precision. I’m afraid that the calculation results become unacceptable despite a possibly faster calculation speed.

Thanks

geek1999 · September 27, 2010, 9:52am

Is it worthwhile to switch the Tesla C1060 cards to GTX 480 in terms of double precision performance?

For single precision, GTX 480 runs over 50% faster than Tesla C1060 from my experience. I think it is worthwhile to switch the cards for single precision calculations. But I don’t know the case of double precision. I’m afraid that the calculation results become unacceptable despite a possibly faster calculation speed.

Thanks

Jimmy_Pettersson · September 27, 2010, 10:05am

I think that if you’re application is bandwidth bound it doesn’t make a huge difference between a GTX480 and a quadro/tesla card since it will be the bandwidth and not the compute units that is the bottleneck. This is often the case in many applications. Furthermore the GTX480 has a higher bandwidth since it’s clocked higher…

Jimmy_Pettersson · September 27, 2010, 10:05am

I think that if you’re application is bandwidth bound it doesn’t make a huge difference between a GTX480 and a quadro/tesla card since it will be the bandwidth and not the compute units that is the bottleneck. This is often the case in many applications. Furthermore the GTX480 has a higher bandwidth since it’s clocked higher…

Jimmy_Pettersson · September 27, 2010, 10:05am

I think that if you’re application is bandwidth bound it doesn’t make a huge difference between a GTX480 and a quadro/tesla card since it will be the bandwidth and not the compute units that is the bottleneck. This is often the case in many applications. Furthermore the GTX480 has a higher bandwidth since it’s clocked higher…

anthonyfmorse · September 27, 2010, 11:27am

I might have this wrong, but I thought NVIDIA had disabled double precision on all the fermi GTX cards to encourage you to buy the C2060 tesla.

anthonyfmorse · September 27, 2010, 11:27am

I might have this wrong, but I thought NVIDIA had disabled double precision on all the fermi GTX cards to encourage you to buy the C2060 tesla.

anthonyfmorse · September 27, 2010, 11:27am

I might have this wrong, but I thought NVIDIA had disabled double precision on all the fermi GTX cards to encourage you to buy the C2060 tesla.

YDD · September 27, 2010, 1:50pm

No, they just disabled most of the double precision units.

YDD · September 27, 2010, 1:50pm

No, they just disabled most of the double precision units.

YDD · September 27, 2010, 1:50pm

No, they just disabled most of the double precision units.

seibert · September 27, 2010, 2:25pm

Not disabled, just crippled. The full Fermi does DP at 1/2 the rate of SP, and the GeForce Fermi cards do DP at 1/8 the rate of SP, which is just like the GT200 Tesla cards. So, the overall increase in # of CUDA cores in the GeForce Fermi chips gives you a net improvement in double precision over the last generation Tesla, even with the performance cap.

seibert · September 27, 2010, 2:25pm

Not disabled, just crippled. The full Fermi does DP at 1/2 the rate of SP, and the GeForce Fermi cards do DP at 1/8 the rate of SP, which is just like the GT200 Tesla cards. So, the overall increase in # of CUDA cores in the GeForce Fermi chips gives you a net improvement in double precision over the last generation Tesla, even with the performance cap.

seibert · September 27, 2010, 2:25pm

Not disabled, just crippled. The full Fermi does DP at 1/2 the rate of SP, and the GeForce Fermi cards do DP at 1/8 the rate of SP, which is just like the GT200 Tesla cards. So, the overall increase in # of CUDA cores in the GeForce Fermi chips gives you a net improvement in double precision over the last generation Tesla, even with the performance cap.

DanaJ · September 28, 2010, 2:25pm

Ideally you’d go to a Tesla 2050 or 2070, where you’d get full DP performance as well as the Tesla vs. consumer intangibles. I’ll assume that’s not an option since you didn’t ask about it, so let’s ignore all those discussions.

As Jimmy said, if you’re completely bandwidth bound, and have well optimized kernels, then probably little benefit. However you’re getting a decent improvement on your SP code so this may not be true.

On my CFD-type code, we’re definitely getting a speedup in double precision when comparing an S1070 to a standard-clocked GTX 470. About 1.5 - 1.6x for most kernels. A GTX 480 should be faster in both memory and GPU clock.

DanaJ · September 28, 2010, 2:25pm

Ideally you’d go to a Tesla 2050 or 2070, where you’d get full DP performance as well as the Tesla vs. consumer intangibles. I’ll assume that’s not an option since you didn’t ask about it, so let’s ignore all those discussions.

As Jimmy said, if you’re completely bandwidth bound, and have well optimized kernels, then probably little benefit. However you’re getting a decent improvement on your SP code so this may not be true.

On my CFD-type code, we’re definitely getting a speedup in double precision when comparing an S1070 to a standard-clocked GTX 470. About 1.5 - 1.6x for most kernels. A GTX 480 should be faster in both memory and GPU clock.

Topic		Replies	Views
Double precision performance CUDA Programming and Performance	5	5656	May 22, 2011
Fix for GTX480 DP performance CUDA Programming and Performance	18	16735	August 20, 2010
Double precision: GTX 465, GTX 480 and C2050 CUDA Programming and Performance	16	3779	September 9, 2010
GeForce 570 vs. Tesla c2050 CUDA Programming and Performance	3	1773	August 16, 2011
Double precision and CUDA CUDA Programming and Performance	9	7802	October 21, 2013
Double precision throughput on GTX's CUDA Programming and Performance	2	3522	August 12, 2011
Disappointed performance using C2050 CUDA Programming and Performance	20	7755	September 2, 2010
Tesla vs GeForce archs What makes the tesla better? CUDA Programming and Performance	8	18334	September 14, 2009
double precision and GeForce card capable of double prec calcs? CUDA Programming and Performance	4	14453	June 28, 2011
GTX 280 and Tesla 10 DP How much DP peak? CUDA Programming and Performance	8	11464	June 17, 2008

Tesla C1060 vs GTX 480 Double precision performance

Related topics