CUDA Block-level Shared Registers

Daniel_Wong · May 23, 2021, 5:52pm

Hi, All

Is there any way that I can use the registers such that are visible for a warp/block of threads to access, just like the shared memory ?

One more assumption is that I already have the size for each warp/block, for example, I need 64xsizeof(float) registers.

The major reason for this is that I found the random access from a warp to shared memory is very slow in the case of uncoalsed access required by the application.

Thanks

Robert_Crovella · May 24, 2021, 12:50am

Topic		Replies	Views
Shared memory and register usage - just 1 thread/block CUDA Programming and Performance	1	801	July 21, 2009
Warp specialize register usage CUDA Programming and Performance	4	962	July 13, 2024
Registers and Shared Memory question CUDA Programming and Performance	7	5457	September 10, 2007
utilize registers CUDA Programming and Performance	2	379	March 27, 2019
Register Usage & Shared Memory How to limit usage properly? CUDA Programming and Performance	1	4850	June 30, 2008
Shared memory using structure instead of array CUDA Programming and Performance	7	1344	February 29, 2020
newbie question shared mem CUDA Programming and Performance	2	810	April 16, 2009
Access to CUDA Shared memory from the host CUDA Programming and Performance	4	1192	December 18, 2018
Register or shared memory? CUDA Programming and Performance	5	4174	July 31, 2009
register allocation behaviour CUDA Programming and Performance	2	426	January 9, 2019

CUDA Block-level Shared Registers

Related topics