-deviceemu and C++ code in host functions

Larissa · May 13, 2009, 8:22pm

Hi everyone, I ran into a problem compiling some code with VS2005 and Cuda 2.2 if I run with the -deviceemu flag and there’s C++ code in the host functions. (I had a related issue which I posted about in: http://forums.nvidia.com/index.php?showtopic=94569&st=0 a few weeks ago. I have only just been able to upgrade to VS2005 and Cuda 2.2 to test). I have WinXP and a Quadro 570 if that is useful information.

I’ve included some code at the end that produces the error for me.

To Generate Problem:

a) there’s C++ code in the host functions in the .cu files

B) that C++ function is referenced from a .cpp file elsewhere

c) you are compiling with -deviceemu

Result: you get unresolved symbols with respect to items in the .cu file. It looks like the .cu’s associated object file gets different symbols in device emulation mode versus not but the rest of the code remains the same

Compiling normally everything links and runs

dumpbin on kernel.obj:

023 00000000 SECT8 notype () External | ?launch_kernel@@YA_NH@Z (bool __cdecl launch_kernel(int))

Files generated during compile:

kernels.cu

tmpxft_00000edc_00000000-3_kernels.cudafe1.gpu

tmpxft_00000edc_00000000-8_kernels.cudafe2.gpu

tmpxft_00000edc_00000000-3_kernels.cudafe1.cpp

tmpxft_00000edc_00000000-13_kernels.ii

Compiling with -deviceemu flag:

Linker error:

main.obj : error LNK2019: unresolved external symbol “bool __cdecl launch_kernel(int)” (?launch_kernel@@YA_NH@Z) referenced in function _main test-cuda.exe : fatal error LNK1120: 1 unresolved externals

dumpbin on kernel.obj:

072 00000000 SECT1C notype () External | __Z13launch_kerneli

Files generated during compile:

kernels.cu

tmpxft_00000f6c_00000000-3_kernels.cudafe1.c

tmpxft_00000f6c_00000000-7_kernels.i

Also, it looks like nvcc turns any C++ host code into C code during preprocessing in emulation mode? (turning classes into structs and bools into chars or ints?)

Other Notes:

If I do extern C around the function in the header file (and get rid of any C++ specific stuff) I can compile with -deviceemu and everything

works.

If all the code is contained in the single .cu file, rather than split between a .cu file and a .cpp file, everything works fine
If I run in normal mode, everything appears to work fine.

Is the solution to only use C code in files that nvcc will be asked to parse?

Any help would be appreciated,

Thanks,

Larissa

----main.cpp----

#include "kernels.h"

#include <stdio.h>

int main()

{

  if (launch_kernel(1024*1024))

	printf("Kernel launched successfully\n");

  else

	printf("Kernel failure\n");

}

----kernels.h----

bool launch_kernel(int pixels);

----kernels.cu----

#include <stdio.h>

#include "kernels.h"

__global__ void do_something(float *dest, float *sourcea, float 

*sourceb, int size)

{

  int pixel = blockIdx.x * blockDim.x + threadIdx.x;

  if (pixel < size) {

	dest[pixel] = sourcea[pixel] * sourceb[pixel];

  }

}

bool launch_kernel(int pixels)

{

  int block_size = 128;

  int n_blocks = (pixels / block_size) + (pixels % block_size)?1:0;

int size = sizeof(float) * pixels;

float *dest;

  float *a;

  float *b;

  float *tmp = (float *)malloc(size);

// allocate memory

  cudaMalloc((void **)&dest, size);

  cudaMalloc((void **)&a, size);

  cudaMalloc((void **)&b, size);

// fill memory with random data for testing purposes only

  for (int i = 0; i < pixels; i++)

	tmp[i] = (float)i * .5;

  cudaMemcpy(a, tmp, size, cudaMemcpyHostToDevice);

  for (int i = 0; i < pixels; i++)

	tmp[i] = i*i;

  cudaMemcpy(b, tmp, size, cudaMemcpyHostToDevice);

do_something<<<n_blocks, block_size>>>(dest, a, b, pixels);

  cudaError_t lastError = cudaGetLastError();

  if (lastError == cudaSuccess)

	return true;

  else

	return false;

}

tmurray · May 13, 2009, 9:29pm

Works fine on Linux, time to boot into Windows and see what’s wrong with that…

tmurray · May 13, 2009, 11:45pm

argh, Windows is breaking something and compiling this as C for some reason. Don’t entirely know why yet, but I’ve asked the compiler guys to look into it.

Larissa · May 14, 2009, 1:44pm

I’m glad you were able to reproduce it.

Since you said that linux works, I’ll just use that for debugging on our end for the time being. Thanks!

Jarrad · May 14, 2009, 3:45pm

There’s another thread currently discussing the same issue:

[url=“http://forums.nvidia.com/index.php?showtopic=96641”]http://forums.nvidia.com/index.php?showtopic=96641[/url]

You seem to have described the problem in much greater detail than us though ;).

theMarix · May 14, 2009, 10:39pm

– sorry, ignore this post, thought you reffered to another thread –

TL1 · June 16, 2009, 10:43am

So is there a workaround for this issue?

tmurray · June 16, 2009, 3:45pm

not in 2.2, no

Topic		Replies	Views
After upgrading to Cuda 2.2 I have C++ problems ! CUDA Programming and Performance	6	5554	May 15, 2009
CUDA 2.2 beta --device-emulation problem CUDA Programming and Performance	5	5563	April 8, 2009
unresolved external symbol _main referenced in function ___tmainCRTStartup CUDA Programming and Performance	7	9310	February 22, 2011
deviceemu when you have .c files CUDA Programming and Performance	2	2410	May 30, 2008
Linker errors for "__host__ __device__ functions" CUDA Programming and Performance	1	1232	October 12, 2008
C++ and CUDA using CUDA with C++ CUDA Programming and Performance	11	10769	August 27, 2008
-deviceemu crashes on Vista32 CUDA Programming and Performance	0	2028	August 18, 2009
NVCC forces c++ compilation of .cu files CUDA Programming and Performance	11	25475	December 11, 2011
NVCC creates host symbols for device functions after explicit class template instantiation CUDA NVCC Compiler cuda	10	52	September 11, 2024
nvcc: C++11 standard in CUDA frontend? (dependencies, gcc, Windows vs. Linux) CUDA Setup and Installation	9	29089	September 19, 2014

-deviceemu and C++ code in host functions

Related topics