Is there a way to hint regarding an input pointer alignment property to nvcc?

laughingrice · November 16, 2011, 6:02pm

I have a problem when loading uchar4 for example from a char *, something like:

char * buf;
uchar4 val = *(uchar4 *)&buf[threadIdx.x];

It seems that CUDA won’t assume that buf is aligned to 4 bytes and breaks the load down to 4 different loads, each one of one byte. The only way to fix this is to load as an integer, but it doesn’t work with larger sizes (such as int2 for example).

Is there a way explicitly tell the compiler that a give pointer is assured to be aligned to some boundary?

Interestingly it seems that OpenCL does assume that (probably as with OpenCL you have to pass a clmem object rather than a pointer).

Thanks

bunnyFair · November 19, 2011, 4:14pm

there is an align(x) qualifier, x = 8 or 16 in the docs (guess it’ll take a 4), which is used on structs. so if you wrapped your array in a struct with that qualifier, maybe that would do it. e.g. struct align(4){char *buf;} mystruct. Something like that.

I would also try, more simply,
char[4] *buf;
uchar4 val = (uchar4) buf[threadIdx.x];

Sorry, my C is pretty rusty… worth a try maybe.

Topic		Replies	Views
Char to uint32_t Pointer recasting Oddity Single thread exhibits different behavior then others CUDA Programming and Performance	3	2884	February 12, 2012
Device emulator alignment bug CUDA Programming and Performance	2	1922	July 18, 2007
nvcc fatal : Unknown option 'fno-strict-aliasing' CUDA Programming and Performance	4	2139	October 8, 2013
Passing a struct to CUDA kernel as parameter - 'align' specifier needed ? CUDA Programming and Performance	5	3193	October 12, 2016
Alignment requirements, shared memory CUDA Programming and Performance cuda	11	892	September 2, 2024
cuda unify and memory alignement for CPU CUDA Programming and Performance	2	1129	November 21, 2016
Vector load "int4 veca1 = reinterpret_cast<int4*>(&a[2])[0];" valid? CUDA Programming and Performance	5	1024	February 17, 2020
use constant memory to pass kernel parameters as struct CUDA Programming and Performance	4	43959	April 30, 2011
Question about cuda-memcheck manual CUDA Programming and Performance	2	525	July 12, 2011
Help with Built in Vector Types Section B.3.1 CUDA Guide 2.2 CUDA Programming and Performance	0	2621	July 6, 2009

Is there a way to hint regarding an input pointer alignment property to nvcc?

Related topics