Any advice on adjusting code for Maxwell when coming from Kepler

njuffa · November 6, 2014, 6:45am

Looks like we are getting back straight to the integer multiplies which are emulated on Maxwell. You would definitely want to look at the SASS to see how many additional instructions that produces on Maxwell. The expansion may create more additional instructions than can be absorbed by the higher instruction throughput on Maxwell, leading to overall slowdown.

As I recall, PTX offers about 25 different flavors of IMUL and IMAD, I think it is possible that some of the emulation sequences may not be fully optimized, in which case you may want to consider filing a bug.

If Maxwell follows the precedent set by previous GPUs, the 64-bit integer conversion instructions are handled by the double-precision execution pipe as that is the only execution path that can consume and deliver 64-bit data. I think the DP ratio of Maxwell consumer parts is lower than the DP ratio of Kepler consumer parts? If so, that could also play a role.

Topic		Replies	Views
What's new in Maxwell 'sm_52' (GTX 9xx) ? CUDA Programming and Performance	69	26902	December 23, 2014
So what's new about Maxwell? CUDA Programming and Performance	166	55889	March 10, 2015
Unofficial Kepler Slides from Random Gamer Site Yeah, yeah, but we only have another week to rumor-m CUDA Programming and Performance	63	10327	April 5, 2012
Technical questions on GTX1080ti multiplication CUDA Programming and Performance	14	1893	November 11, 2017
Speedy general reduction sum code ( ~88.5 % of peak ) Updated for Kepler! __shfl() .... etc,. CUDA Programming and Performance	53	14907	March 24, 2018
Cuda 3.5 Integer Multiply Performance Is it really 3x slower than 64-bit floating point? CUDA Programming and Performance	21	19879	March 12, 2014
Maxwell suddernly becomes 10x slower CUDA Programming and Performance	15	4564	February 24, 2016
Cuda program results are always zero in HW, correct in EMU? CUDA Programming and Performance	35	11117	May 23, 2010
Forward looking GPU integer performance CUDA Programming and Performance	22	21331	March 20, 2017
Kepler and Maxwell, oh my! CUDA Programming and Performance	55	55755	October 19, 2010

Any advice on adjusting code for Maxwell when coming from Kepler

Related topics