Better control of register use

XenoByte · June 30, 2009, 10:20pm

I haven’t been able to find specific information on how I can get better control over the use of registers in my kernels without resorting to PTX programming. Is this correct, is PTX the only way to really accomplish this? Are there possibly some other techniques that can be employed in C code that will force the compiler to re-use registers, for example? I’ve had some success freeing registers by moving variables to shared memory arrays but this seems to slow down kernel execution. I found some information on the use of volatile variables with the claim that this can help PTX to be more parsimonious with registers but the programming guide indicates that volatile is for global or shared memory and makes no mention of it’s use with automatic variables which typically end up in registers.

I’m sure this information must be out there but I’ve searched the forum with various phrases using the word “registers” with no joy.

Thanks to anyone who can provide a pointer.

 - Richard

Nico · June 30, 2009, 10:28pm

I haven’t been able to find specific information on how I can get better control over the use of registers in my kernels without resorting to PTX programming. Is this correct, is PTX the only way to really accomplish this? Are there possibly some other techniques that can be employed in C code that will force the compiler to re-use registers, for example? I’ve had some success freeing registers by moving variables to shared memory arrays but this seems to slow down kernel execution. I found some information on the use of volatile variables with the claim that this can help PTX to be more parsimonious with registers but the programming guide indicates that volatile is for global or shared memory and makes no mention of it’s use with automatic variables which typically end up in registers.

I’m sure this information must be out there but I’ve searched the forum with various phrases using the word “registers” with no joy.

Thanks to anyone who can provide a pointer.

Richard

You can always try passing the -maxrregcount option to nvcc to reduce register usage.

If you’re moving variables to registers you should also try to reduce bank conflicts as much as possible.

N.

XenoByte · July 1, 2009, 1:45am

I’m already using --maxregcount set to 32. I can look at the PTX and see that virtually none of the registers are being re-used and this seems to me to be the major problem. Even when only a conversion is being done the compiler is taking up a register and not re-using it, and whatever I do to create intermediate automatic variables and re-use them seems to have little or no effect.

R

jma · July 1, 2009, 4:52am

Google: ptx, register, reuse

PTX is an intermediate language, not the final assembly output. Use decuda to verify your assumption. Consensus here, so far, has been that register reuse is done in the final stage of translating the PTX code to native machine instructions.

http://forums.nvidia.com/index.php?showtopic=89573

XenoByte · July 1, 2009, 12:11pm

jma, thanks so much for this link. I think I’ll start using Google to search rather than the search engine in this forum. BTW, I haven’t been able to get decuda to run with G10 (it was created for G8 and G9) and the last update to decuda is quite old. Is there a new version of decuda that is not listed on the main decuda page?

R

Topic		Replies	Views
Getting nvcc to consolidate registers CUDA Programming and Performance	19	19673	November 19, 2012
Why would recycling registers increase register count? CUDA Programming and Performance	1	614	September 10, 2018
reducing the number of used registers CUDA Programming and Performance	8	6432	September 22, 2009
how to reduce registers in each kernel CUDA Programming and Performance	2	1183	November 4, 2009
Register usage How good is the compiler? CUDA Programming and Performance	6	3168	April 3, 2008
Freeing of temporary registers CUDA Programming and Performance	5	4453	May 21, 2007
Register usage CUDA Programming and Performance	4	1155	March 13, 2012
register usage according to the ptx file CUDA Programming and Performance	3	4321	June 26, 2009
Use of register An odd problem CUDA Programming and Performance	12	2447	August 12, 2010
Weird use of registers Too many registers are wasted CUDA Programming and Performance	8	5587	July 4, 2007

Better control of register use

Related topics