Problem with left shift

GiuseppeC · January 21, 2011, 5:12pm

I’ve a problem with the “<<” operator (left shift).

Somewhere in my code there is the following define:

#define ROL64(a, offset) ((offset != 0) ? ((((UINT64)a) << offset) ^ (((UINT64)a) >> (64-offset))) : a)

When I try to compile my program I obtain a very strange error:

### Assertion failure at line 1923 of ../../be/cg/cgemit.cxx:

### Compiler Error in file /tmp/tmpxft_000022e5_00000000-7_kernel.cpp3.i during Assembly phase:

### incorrect register class for operand 1

nvopencc INTERNAL ERROR: /opt/cuda/open64/lib//be returned non-zero status 1

This error disappears if, instead of using the “<<”, I use the “>>” (right shift). I do not understand this behavior. By now I’m using the following “trick”

#define ROL64(a, offset) ((offset != 0) ? ((((UINT64)a) >> (0-offset)) ^ (((UINT64)a) >> (64-offset))) : a)

but I’m not sure this has the expected behavior.

Any Ideas?

SPWorley · January 21, 2011, 6:23pm

I submitted a similar bug to NVIDIA a few weeks ago which is clearly a compiler bug. This is probably the same one. It had to do with a shifts of 64 bit long values when the shift had integer math in it.

It only failed for sm_20 and not sm_13.

As a workaround, just put the shift amount into a temporary variable.

// compile with nvcc -arch sm_20 crash.cu

__device__ unsigned long circularShift(unsigned long v, int shift)

{

    return (v<<(64-shift));

}

int main() {}

njuffa · January 21, 2011, 7:12pm

I recall a compiler bug being reported regarding 64-bit rotates recently. If memory serves, one workaround is to assign the left-shift and right-shift portions to temporary variables, then combine them (with | or + for example) in a separate step. Unfortunately that won’t work for a macro due to the need for temp variables. Could you use a forceinline function instead?

BTW, negative shift counts cause undefined behavior in C/C++, so assuming that offset is >= 0, ((a) >> (0-offset)) will not in general give the desired result.

SPWorley · January 21, 2011, 10:15pm

Just as an aside, since shifts aren’t a common topic usually worth posting for: GPU shifts behave differently than CPU shifts when the shift amount exceeds the word size in bits.

This is undefined behavior in the C standard, and is one of the few cases where NVIDIA GPUs differ in their integer computation from Intel and AMD CPUs.

On the CPUs, a shift amount >= word size will end up using the low order bits of the shift, so a integer shift of 33 is the same as a shift of 1, for example.
On the GPU, the shift value is saturated, so a shift of 32 or more acts like 32.
This is true for right or left shifts.

Again this is not a bug, it’s documented as undefined behavior in C itself. It’s interesting there’s a difference though.

GiuseppeC · January 24, 2011, 5:56pm

Thank you very much for the detailed informations :)

I modified my code in this way:

// NVCC Bug

//#define ROL64(a, offset) ((offset != 0) ? ((((UINT64)a) << offset) ^ (((UINT64)a) >> (64-offset))) : a)

__device__ inline UINT64 ROL64(UINT64 a, unsigned int offset)

{

        const int _offset = offset;

        return ((offset != 0) ? ((a << _offset) ^ (a >> (64-offset))) : a);

}

By now I hope it will be ok, however I have to check the range of the variable “offset” in order to avoid misalignment between the CPU and GPU code.

Topic		Replies	Views
Problem with left shift CUDA Programming and Performance	0	2693	January 21, 2011
Incorrect x86 instruction emitted by Emu nvcc Right-shift is broken! CUDA Programming and Performance	13	6223	August 18, 2008
Compiler Error for Byte To Int Conversion CUDA Programming and Performance	16	1429	May 21, 2011
Switch oddities Compiler bug? CUDA Programming and Performance	16	4552	September 10, 2008
problems reported with __ffs()? CUDA Programming and Performance	9	7597	October 31, 2010
Unexpected result with 64-bit integer left-shift CUDA NVCC Compiler	4	774	April 7, 2022
>=32bit shift on 32bit integer CUDA Programming and Performance cuda	5	1131	October 12, 2021
Right/left shift operations for int4 CUDA Programming and Performance	0	1498	March 12, 2011
compiler bug? bit shift CUDA Programming and Performance	7	4519	November 28, 2008
Problem with Shift Left Can anyone help? CUDA Programming and Performance	4	4636	May 27, 2009

Problem with left shift

Related topics