Volatile pointers to vector types (e.g. int4*) No operator for "volatile int4 = int4"

blloyd · July 22, 2011, 8:08pm

Suppose I have the following generic code for performing a prefix-sum across a warp:

int t = threadIdx.x;

    Type Sum;

    __shared__ Type pShared[16+32];

    volatile Type* pScratch = pShared + 16;

    pScratch[t-16] = Type();

    pScratch[t] = Sum = pValue[t];

    pScratch[t] = Sum = Sum + pScratch[t-1];

    pScratch[t] = Sum = Sum + pScratch[t-2];

    pScratch[t] = Sum = Sum + pScratch[t-4];

    pScratch[t] = Sum = Sum + pScratch[t-8];

                  Sum = Sum + pScratch[t-16];

The pointer to shared memory must be volatile, otherwise the compiler won’t actually write the partial sum to memory at each step. This code works fine for primitive types (int, float, etc.). For vector types it is necessary to define the ‘+’ operator. For example, for int2 we have:

__device__ inline

int2 operator+( const int2& a, const int2& b )

{

    return make_int2( a.x + b.x, a.y + b.y );

}

The problem is that when Type is not primitive the compiler complains that there is no operator for ‘volatile Type = Type’. I don’t think it is possible to define operator=() for the vector types without changing the cuda header files, which I don’t want to do. Even if I were to do that, I am not sure what the signature for that operator should look like. Nothing that I have tried has worked.

Does anyone know a clean way to deal with this problem?

akavo · July 24, 2011, 3:36am

Perhaps the real problem is that ‘Sum’ should be declared as volatile.

Topic		Replies	Views
using volatile with vector types like float4 is it possible? CUDA Programming and Performance	1	1844	May 23, 2012
No vector addition operator? getting arror when adding 2 float4 values CUDA Programming and Performance	5	9093	May 24, 2011
Unsynchronized shared memory access CUDA Programming and Performance	5	1040	April 11, 2017
volatile breaks coalescing for vector types volatile trick can backfire. CUDA Programming and Performance	7	2059	November 4, 2009
swizzling? float4 arithmetic support? CUDA Programming and Performance	4	7859	February 20, 2007
error: no operator "=": volatile Struct1= Struct1 volatile CUDA Programming and Performance	2	3726	August 16, 2008
mathematical operation in built-in vector type? CUDA Programming and Performance	9	5665	June 29, 2009
pointer to shared memory compiler problems CUDA Programming and Performance	19	14591	June 7, 2008
volatile in CUDA Fortran Legacy PGI Compilers	5	5842	August 25, 2012
Vector type operations in cuda CUDA Programming and Performance	3	4362	June 3, 2016

Volatile pointers to vector types (e.g. int4*) No operator for "volatile int4 = int4"

Related topics