The topic title says it all: if a variable is declared as a register variable can I be sure that CUDA really puts it into a register and not into (any of the various layers of) memory? So far what I see is only that syntactically “register float x” doesn’t choke nvcc :)
Something related: if in a kernel there is a simple declaration like ‘float x’ which layer of memory will this variable x end up in?