Does the GTX1060 support double precision?

23022017 · February 23, 2017, 10:12pm

Does the GTX 1060 support double precision? What is it’s double precision throughput in Gflops. I couldn’t find any information on this on the NVIDIA website. Is there any table which details the double precision capabilities of the NVIDIA GPUs.

The only source of information which I found is external:

However, the information on Wikipedia may not be correct.

I intent to use the GTX 1060 for scientific computations which require double precision capabilities.

Robert_Crovella · February 23, 2017, 10:29pm

The double precision throughput is 1/32 the single precision throughput.
This is derived from the compute capability 6.1 entries in this table:

[url]Programming Guide :: CUDA Toolkit Documentation

(4/128 = 1/32)

According to this particular example:

[url]NVIDIA GTX 1060 Review & Benchmark vs. RX 480 (Ft. MSI Gaming X) | GamersNexus - Gaming PC Builds & Hardware Benchmarks

The FP32 throughput is ~6.5 TF, so the DP (FP64) throughput should be around 0.2TF or 200GFlops.

It will vary somewhat based on actual clock - which may vary depending on the board you have and boost activity.

I think the Wikipedia info is pretty accurate also.

23022017 · February 23, 2017, 10:51pm

Thanks for the input.

In the article

it is written “Like the other GTX chips, GP106 dedicates itself to FP32 single precision compute, leaving double precision FP64 to CUDA Cores science-class GPUs.”

This would mean that there is no double precision computation with the GTX 1060.

Robert_Crovella · February 23, 2017, 11:01pm

No, that wouldn’t be the correct interpretation. However the idea is that since FP64 throughput is 1/32 of FP32 throughput (i.e. much smaller) then people who are interested in high levels of FP64 performance should probably consider “science class GPUs” by which they mean various members of the Tesla family of GPUs.

njuffa · February 24, 2017, 12:11am

The GFLOPS information for the GTX 1060 in this Wikipedia table seem correct to me:
[url]https://en.wikipedia.org/wiki/GeForce_10_series[/url]

In many cases the performance differences between single-precision and double-precision computation are not nearly as severe as the raw throughput ratios would suggest:

(1) On GPUs generally, is not generally possible to get more than about 75% of theoretical single-precision peak performance out of compiled code, e.g. due to scheduling and register bank conflicts. But on DP-lite consumer GPUs, it is possible to get 99% of theoretical double-precision performance when that becomes the most severe bottleneck.

(2) Most real-life codes, even when classified as floating-point intensive, execute many non-floating-point operations that aren’t affected by the disparities between SP and DP throughput.

Topic		Replies	Views
Double precision throughput on GTX's CUDA Programming and Performance	2	3517	August 12, 2011
Double precision and CUDA CUDA Programming and Performance	9	7764	October 21, 2013
Double precision GFlops of Kepler CUDA Programming and Performance	10	10092	April 7, 2012
GTX2xx double precision support CUDA Programming and Performance	1	1972	October 16, 2009
Double precision for mobile Nvidia Mobile GPUs CUDA Programming and Performance	4	1038	July 21, 2011
GT 240 and double precision CUDA Programming and Performance	4	15092	February 8, 2011
GTX 280 and Tesla 10 DP How much DP peak? CUDA Programming and Performance	8	11445	June 17, 2008
what is the double-precision flops rating of the gtx580? CUDA Programming and Performance	16	33460	April 10, 2014
Performance of GTX 980 Ti as a General Purpose GPU CUDA Programming and Performance	5	4182	September 29, 2015
Detailed double precision to single precision ration in nVidia GPUs? CUDA Programming and Performance	5	6354	January 2, 2014

Does the GTX1060 support double precision?

Related topics