Fix for GTX480 DP performance

jimkress · August 12, 2010, 12:28pm

Has anyone discovered how to “repair” the crippling Nvidia did to the GTX480 DP performance?

eyalhir74 · August 12, 2010, 12:44pm

Yes… buy a C2050 ;)

eyalhir74 · August 12, 2010, 12:44pm

Yes… buy a C2050 ;)

jack · August 17, 2010, 1:28pm

No, it’s probably disabled in hardware or in the device’s firmware. In any case, I don’t think nVidia would be OK with people publicly posting a “fix” for it here on their own forum.

jack · August 17, 2010, 1:28pm

No, it’s probably disabled in hardware or in the device’s firmware. In any case, I don’t think nVidia would be OK with people publicly posting a “fix” for it here on their own forum.

jimkress · August 17, 2010, 3:02pm

:rolleyes: Probably true …

OH well …

jimkress · August 17, 2010, 3:02pm

:rolleyes: Probably true …

OH well …

ONeill · August 18, 2010, 9:43am

Adding a question mark to the topics title would be nice…
Imagine a kid on xmas-eve when you take his new toy away from him! Imagine those eyes and big tears in them :)

ONeill · August 18, 2010, 9:43am

Adding a question mark to the topics title would be nice…
Imagine a kid on xmas-eve when you take his new toy away from him! Imagine those eyes and big tears in them :)

Croow · August 18, 2010, 10:54pm

What is “DP”, double precision?
How severe is the 480s degredation?

Thanks

Croow · August 18, 2010, 10:54pm

What is “DP”, double precision?
How severe is the 480s degredation?

Thanks

seibert · August 19, 2010, 3:22am

Yes. The Tesla C2050 computes in double precision at 1/2 the rate of single precision, whereas the GTX 465/470/480 series of cards compute in double precision at 1/8 the rate of single precision (like both the Tesla and GeForce cards using the previous generation GT200 chip).

seibert · August 19, 2010, 3:22am

Yes. The Tesla C2050 computes in double precision at 1/2 the rate of single precision, whereas the GTX 465/470/480 series of cards compute in double precision at 1/8 the rate of single precision (like both the Tesla and GeForce cards using the previous generation GT200 chip).

Boxed_Cylon · August 19, 2010, 4:03pm

Hummm… According to the latest dgemm posted on the magma site, the C2050 reaches 300 GFlop/s (58% of theoretical peak). My GTX480 on that same benchmark reaches 165 GFlop/s. This is a little better than half the speed of the C2050, but quite a bit better than the numbers suggested above - in double precision the GTX480 is 1/2 the rate of the C2050, rather than 1/4 the rate.

In single precision the C2050 reaches 639 GFlop/s, whereas the GTX480 reaches 835 GFlop/s on the magma sgemm benchmark. Double precision on the C2050 is 1/2 the rate of single precision as you say, but on the GTX480 it is 1/5 (165/840) the rate of single precision. But it seems the GTX480 runs a little faster than the C2050, so that in the end its double precision is just half the rate of the C2050.

Boxed_Cylon · August 19, 2010, 4:03pm

Hummm… According to the latest dgemm posted on the magma site, the C2050 reaches 300 GFlop/s (58% of theoretical peak). My GTX480 on that same benchmark reaches 165 GFlop/s. This is a little better than half the speed of the C2050, but quite a bit better than the numbers suggested above - in double precision the GTX480 is 1/2 the rate of the C2050, rather than 1/4 the rate.

In single precision the C2050 reaches 639 GFlop/s, whereas the GTX480 reaches 835 GFlop/s on the magma sgemm benchmark. Double precision on the C2050 is 1/2 the rate of single precision as you say, but on the GTX480 it is 1/5 (165/840) the rate of single precision. But it seems the GTX480 runs a little faster than the C2050, so that in the end its double precision is just half the rate of the C2050.

Boxed_Cylon · August 19, 2010, 7:11pm

empty post.

Boxed_Cylon · August 19, 2010, 7:11pm

empty post.

moozoo · August 20, 2010, 4:58am

What is happening is that the C2050 and GTX 480 are being held back by memory bandwidth etc and not double precision flops.

ie the GTX 480’s 165 Gflops is probably 95% of its peak Gflops.

On other problems the C2050 would probably get a lot more than 300 Gflop.

Double precision implies twice the data must be moved about. So more problems will be memory bandwidth limited than with single precision.

moozoo · August 20, 2010, 4:58am

What is happening is that the C2050 and GTX 480 are being held back by memory bandwidth etc and not double precision flops.

ie the GTX 480’s 165 Gflops is probably 95% of its peak Gflops.

On other problems the C2050 would probably get a lot more than 300 Gflop.

Double precision implies twice the data must be moved about. So more problems will be memory bandwidth limited than with single precision.