Alignment Issue?

danuk · April 20, 2011, 6:05pm

Hi,

If I compile this code:

__global__ void test(double d, float f, unsigned int i) { }

int main()

{

	test<<<1, 1>>>(2.0, 2.0f, 2);

	return 0;

}

like this:

nvcc-4.0 --keep main.cu -o main -arch=sm_20

and look at the cudaSetupArgument functions (cleaned up a little), I get this:

cudaSetupArgument(__par0, 0UL);

cudaSetupArgument(__par0, 8UL);

cudaSetupArgument(__par0, 12UL);

however, if I swap the order of the float and double parameters, I get this:

cudaSetupArgument(__par0, 0UL);

cudaSetupArgument(__par0, 8UL);

cudaSetupArgument(__par0, 16UL);

What’s going on here, why is the float parameter taking up 4 bytes when put second and 8 when put first? Why can’t I just use sizeof(variable) to calculate what these values should be?

Dan

Gregory_Diamos · April 20, 2011, 6:50pm

All primitive types are aligned to their size, this is a restriction of the GPU hardware (most processors that are not x86 have this restriction and even x86 takes a performance hit if you don’t do this). So if the double is specified after the float, then the float starts at address 0, and the double cannot start at address 4 because 4 is not 8-byte aligned, so it is padded and placed at address 8 instead.

danuk · April 21, 2011, 10:19am

That makes complete sense, I guessed it was something to do with alignment, thanks.

Topic		Replies	Views
Pointers in formal parameter list? I don't understand... CUDA Programming and Performance	6	2686	November 17, 2008
Alignment requirements, shared memory CUDA Programming and Performance cuda	11	892	September 2, 2024
Structure Alignment? CUDA Structure Alignment differs? CUDA Programming and Performance	12	49305	December 11, 2008
float4 alignment inconsistency... CUDA Programming and Performance	3	2225	February 19, 2015
[bugreport] __alignof(CUdeviceptr) == 4, should 8 CUDA Programming and Performance	12	27207	July 5, 2010
Shared memory boundary writing issue Writing different data types to shared memory banks CUDA Programming and Performance	2	4638	August 21, 2010
Parameter passing bug in CUDA 2.0 x86_64 CUDA 2.0 compiler, parameter passing CUDA Programming and Performance	8	5129	January 27, 2009
bug (?) with GCC __attribute__ ((aligned (16))); memory alignment corrupts data CUDA Programming and Performance	2	1151	September 10, 2011
Strange change in behaviour between float and double CUDA Programming and Performance	6	1310	April 1, 2009
nvcc fatal : Unknown option 'fno-strict-aliasing' CUDA Programming and Performance	4	2139	October 8, 2013

Alignment Issue?

Related topics