Effect of compiling CUDA for an older compute capability

svlugt · April 20, 2020, 12:38pm

Hi,

I’m using a GPU with compute capability 7.0 with CUDA 10, with legacy code that builds for compute capability (CC) 3.5 explicitly (–gpu-architecture compute_35). I’m wondering what the exact effect is of this configuration. As far as I can find documentation on CC only describes the relation as in “is my hardware capable of supporting this feature” but I can’t so much find a description of the behavior for legacy code other than that CUDA 10 still supports down to CC 3.x (however it is the last version to support CC 3.x).

For example, table 15 in the CUDA programming guide denotes that for CC 3.5 the maximum number of resident blocks per multiprocessor is 16, where my device with CC 7.0 has 32 as maximum number of resident blocks per multiprocessor. Will compilation with CC 3.5 result in a maximum of 16 blocks per multiprocessor here and thus simulate or force CC 3.5 behavior on my CC 7.0 device?

What else should I be weary of when porting CUDA code with CC 3.5 to 7.0?

Thank you for your time!

Topic		Replies	Views
Compute Capability for Geforce GTX 740m CUDA Programming and Performance	4	2753	October 9, 2014
compute capability/CUDA Toolkit 3.1 CUDA Programming and Performance	2	9035	July 13, 2010
code generation sm_10 CUDA Programming and Performance	3	761	November 24, 2013
Supporting cards of different compute capabilities in a single executable CUDA Programming and Performance	3	1794	June 10, 2009
cudaMalloc is not working on the gtx 1070 with cuda 8.0 CUDA Programming and Performance	6	1411	October 18, 2016
'compute_10' and 'sm_10' architectures are depreciated Legacy PGI Compilers	1	16782	June 18, 2014
Compute Capability 1.0 faster than 3.5? CUDA Programming and Performance	8	1852	August 9, 2013
GTX 770 and Compute Capability 3.5 CUDA Programming and Performance	13	17363	April 25, 2019
Changing CUDA version CUDA Developer Tools	0	986	December 10, 2020
Run code for mixed compute capability cards CUDA Programming and Performance	1	1529	January 15, 2009

Effect of compiling CUDA for an older compute capability

Related topics