Possible bug in cuComplex.h Is typedef of cuFloatComplex correct

MMB · July 1, 2010, 9:07pm

The include file cuComplex.h contains a statement: “typedef complex cuFloatComplex”. The C99 complex type defaults to double precision. Hence, shouldn’t this statement really be: “typedef float complex cuFloatComplex”?

Comments anyone?

MMB

clamport · July 2, 2010, 2:58pm

I believe with devices that don’t support double precision it is automatically truncated to single precision, but don’t quote me on it.
~clamport

MMB · July 4, 2010, 10:07pm

Hi clamport. You might be right about that, but double precision support is here to stay. So, if there is a problem it needs to be fixed!

MMB

MMB · July 8, 2010, 10:13pm

bump.

mfatica · July 9, 2010, 12:20am

The typedef is in a section guarded by

#if (!defined(CUDACC) && defined(CU_USE_NATIVE_COMPLEX))

RIght now, it is dead code.

MMB · July 12, 2010, 2:45pm

Hi mfatica, thanks for the reply. Your thoughts on the following will be appreciated:

We have an algorithm using complex arithmetic that we want to run on

the GPU. As a first step, we implement it using C99 native complex

types, compile it using gcc and test it on the host CPU. The second

step is to convert the “float complex” declarations to

“cuFloatComplex”, and the arithmetic operations themselves to use the

inline functions from the cuComplex.h header (e.g. replace “z = a + b”

with z = cuCaddf(a, B), etc), then recompile using gcc using

“-DCU_USE_NATIVE_COMPLEX”. In principle, this should generate

identical CPU machine code. If we’ve done everything right, it will

run again on the host and get the same answer it did before we made

this transformation. The third and final step, obviously, is to

recompile the code unchanged with nvcc and run it on the GPU.

The second step didn’t work. The reason it didn’t work is because

cuComplex.h contained this line

typedef complex cuFloatComplex;

when it should have had

typedef float complex cuFloatComplex;

Now, you can argue that the header is only supposed to work when

compiling with NVCC, but then why include the conditional for

CU_USE_NATIVE_COMPLEX? If you always have NVCC defined, the

preprocessor conditionals make CU_USE_NATIVE_COMPLEX itself dead code.

It seems to me that the way the header was written was to support

exactly this style of algorithm development: implement and test your

algorithm first on the host using native complex types, then switch to

the GPU when you get it working.

MMB

MMB · July 14, 2010, 3:00pm

Hi mfatica, thanks for the reply. Your thoughts on the following will be appreciated:

We have an algorithm using complex arithmetic that we want to run on

the GPU. As a first step, we implement it using C99 native complex

types, compile it using gcc and test it on the host CPU. The second

step is to convert the “float complex” declarations to

“cuFloatComplex”, and the arithmetic operations themselves to use the

inline functions from the cuComplex.h header (e.g. replace “z = a + b”

with z = cuCaddf(a, B), etc), then recompile using gcc using

“-DCU_USE_NATIVE_COMPLEX”. In principle, this should generate

identical CPU machine code. If we’ve done everything right, it will

run again on the host and get the same answer it did before we made

this transformation. The third and final step, obviously, is to

recompile the code unchanged with nvcc and run it on the GPU.

The second step didn’t work. The reason it didn’t work is because

cuComplex.h contained this line

typedef complex cuFloatComplex;

when it should have had

typedef float complex cuFloatComplex;

Now, you can argue that the header is only supposed to work when

compiling with NVCC, but then why include the conditional for

CU_USE_NATIVE_COMPLEX? If you always have NVCC defined, the

preprocessor conditionals make CU_USE_NATIVE_COMPLEX itself dead code.

It seems to me that the way the header was written was to support

exactly this style of algorithm development: implement and test your

algorithm first on the host using native complex types, then switch to

the GPU when you get it working.

MMB

Bump

MMB · July 21, 2010, 2:24am

bump

Topic		Replies	Views
using cuComplex in device and host code CUDA Programming and Performance	7	18834	January 30, 2011
Problem in cuComplex.h (solution added) the use of math.h instead of cmath? CUDA Programming and Performance	0	3723	June 21, 2011
cufftw.h and C99 complex types GPU-Accelerated Libraries	8	2846	September 18, 2013
math.h, tgmath.h and cuComplex.h nvcc compiler CUDA Programming and Performance	0	1337	October 20, 2008
Complex Numbers CUDA Programming and Performance	4	2753	June 3, 2009
cuComplex data type - where is it defined? CUDA Programming and Performance	3	6957	November 11, 2008
attempt at a CUDA complex maths library working in C++ and CUDA likewise CUDA Programming and Performance	5	71723	July 28, 2011
Problems converting cuda complex to C++ complex CUDA Programming and Performance	0	1713	August 14, 2009
cuComplex just need some general advice CUDA Programming and Performance	1	4779	February 15, 2011
Complex Numbers How best to represent? CUDA Programming and Performance	16	9340	November 27, 2019

Possible bug in cuComplex.h Is typedef of cuFloatComplex correct

Related topics