Performance penalty of using threadIdx.x

Mandar_Gurav · December 26, 2012, 6:28pm

What is the performance penalty of using threadIdx.x and other similar system keywords/values. Consider a scenario where I am using this threadIdx.x for a considerable number of times. What is the better option -

Declare a variable and copy the value of the threadIdx.x and use this variable everywhere. OR
Use threadIdx.x as it is everywhere.

I am not sure if somebody has already posted such query already apologies if so.

Thank you.

– Mandar Gurav

pasoleatis · December 26, 2012, 8:59pm

define in kernel:

int myid=threadIdx.x;

this means that myid is put in registers, which means that it can be read instantaneous and is local to each thread.

seibert · December 27, 2012, 12:03am

It doesn’t matter how you access threadIdx.x in your code. The optimizing compiler will almost certainly do the smart thing, so write your code in a way that is easy to read.

In case you are curious what happens at a low level:

When you access threadIdx.x in your C code, nvcc compiles that to an access to %tid.x, which is a "special register."
Depending on what you do with threadIdx.x, a type conversion (a "cvt" instruction) might be required, but the compiler is smart enough to not do it twice if possible.
There is basically no correlation between hardware registers and C variables in any modern compiler. You can affect things somewhat using keywords like "volatile", but it is usually counterproductive.

Topic		Replies	Views
How fast is threadIdx.x? CUDA Programming and Performance	5	1737	April 11, 2011
How fast to access "threadIdx" ? CUDA Programming and Performance	2	5734	January 29, 2008
address evaluation threadIdx,blockDim treated as constants? CUDA Programming and Performance	17	15972	May 20, 2008
Where is threadIdx stored? NVIDIA Technology Question CUDA Programming and Performance	2	3796	December 1, 2010
using same threadIdx for different variables CUDA Programming and Performance	3	1945	May 8, 2012
Built-in Variables Memory Location ? in which memory are built in variables stored CUDA Programming and Performance	3	5892	September 9, 2011
error in using threadIdx.x as integer CUDA Programming and Performance	6	1376	August 11, 2010
The first inruction in cuda CUDA Programming and Performance	2	2966	October 16, 2008
Cost of accessing built-in variables CUDA Programming and Performance	3	1276	August 2, 2009
1D/2D indexes usage in a kernel CUDA Programming and Performance	3	874	January 31, 2011

Performance penalty of using threadIdx.x

Related topics