Looking for a laptop to run scientific simulations in CUDA with double precision - speed is important

ranunculus · November 7, 2017, 7:51pm

I’m planning on using CUDA to write scientific molecular dynamics simulations in which both speed and precision are important - hence the need for doubles. I need this on a laptop since I take my computer to and from work every day. According to the CUDA documentation, any non-GeForce GPU with a compute capability of 3.5 should be able to perform 64-bit floating point operations with one third the speed of 32-bit floating point operations. However, the Wikipedia page for NVIDIA Quadro contradicts this, saying that the Quadro K510M and K610M perform double operations at only 1/24 the speed of single precision, despite having a compute capability of 3.5.

So, my question is, which source is correct? If Wikipedia is correct, what’s the best laptop GPU I can use for fast double calculations in CUDA? Are my best options really the Quadro 5000M or 5010M - which are roughly seven years old at this point - as Wikipedia suggests?

Robert_Crovella · November 7, 2017, 7:58pm

I don’t know where it says that.

ranunculus · November 7, 2017, 8:02pm

Sorry - I meant to say one third rather than one half, post has been edited.

Look at section 5.4.1. Arithmetic Instructions. In the “3.5, 3.7” column, “32-bit floating-point add, multiply, multiply-add” gives 192 Results per Clock Cycle per Multiprocessor, whereas “64-bit floating-point add, multiply, multiply-add” gives 64 results, which is one third. The footnote says that this is only 8 for GeForce GPU’s, but says nothing about Quadro.

njuffa · November 7, 2017, 8:15pm

I am reasonably certain that no mobile GPU (that is, “M” type) that is supported by the currently shipping CUDA 9.0 supports high-throughput double precision. In general, low power and high DP performance do not mix.

Authoritative statements from NVIDIA on this issue are more than welcome.

Robert_Crovella · November 7, 2017, 10:05pm

Yes, if you have a cc3.5 or cc3.7 GPU, that statement is correct (with the additional GeForce vs. Tesla footnote disclaimer). The primary cc 3.5 Geforce exception I am aware of is devices built around GK208 GPU, which goes under the moniker GT640 and others as well.

That is certainly not saying that there is a given ratio for all GeForce and a given ratio for all Tesla. It is nowhere near that simple. But the documentation is correct (AFAIK), if you care to read it carefully and understand it.

I’m not aware of any Quadro GK208 designs, and anyway Kepler (cc3.x = Kepler) is by now a pretty old GPU. I would not recommend buying any Kepler device today. There are better Maxwell, Pascal, (and Volta for non-mobile) choices, regardless of desired features/pricepoint/performance.

There are no non-Tesla cc3.7 GPUs. That particular chip variant exists only in Tesla K80 clothing.

njuffa · November 7, 2017, 10:43pm

In the mobile market segment, realistic choices are Maxwell and Pascal at this time. I would assume it’s going to be almost another year before there will be mobile Volta devices.

ranunculus · November 7, 2017, 10:55pm

Are there any laptop GPUs that are newer than cc3.5 models that can perform double-precision calculations in CUDA 1/2 or 1/3 as fast as single-precision, the way that the Quadro K610M can?

Robert_Crovella · November 8, 2017, 12:23am

Quadro K610M doesn’t perform 1/3 rate DP.

The “full-rate” DP GPUs are:

Tesla P100/Quadro GP100
Volta V100
(no Maxwell)
various Kepler GPUs

None of the above GPUs are available in a laptop form factor that I am aware of.

Topic		Replies	Views
CUDA on a laptop CUDA Programming and Performance	6	6916	June 30, 2009
double precision on mobile GPU CUDA Programming and Performance	17	7972	October 30, 2011
Double Precision on all new Fermis getting to the bottom of DP performance, esp mobile CUDA Programming and Performance	13	9777	October 11, 2010
Student buying card for CUDA. Which one? CUDA Programming and Performance	16	14861	December 4, 2012
cuda and double precision CUDA Programming and Performance	3	7767	July 23, 2009
Best (Quality/Price) graphics card for heavy scientific computing CUDA Programming and Performance	6	5217	April 23, 2011
Critique my HPC Specification CUDA Programming and Performance	8	1332	June 11, 2015
Double precision and CUDA CUDA Programming and Performance	9	7730	October 21, 2013
Which compute capabiility does nvs 5100m support? CUDA Programming and Performance	11	2512	October 12, 2010
Buying GPUs for CUDA simulations CUDA Programming and Performance	2	812	June 28, 2014

Looking for a laptop to run scientific simulations in CUDA with double precision - speed is important

Related topics