Linking cuda header to host code

Shagrat · April 27, 2011, 5:55pm

Hi, I have copied a small example from dev guide, Parallel vector addition. I have a problem linking them together.

Basically, I have a main.c file, add.h and add.cu files. Main.c initializes program, add.cu contains code to run parallel addition on the device and add.h is a header for add.cu. When I want to include the header to main.c (in order to run the addition), I get a linking error:

When I rename the main.c to main.cu, everything works perfect. What am I missing?

Main.c:

#include <stdio.h>

#include "add.h"

int main() {

	int i;

	int A[4] = { 2, 4, 5, 6 };

	int B[4] = { 1, 2, 3, 4 };

	int C[4];

	VecAdd(A, B, C);

	printf("C = {");

	for(i = 0; i < 4; i++){

		printf("%d,", C[i]);

	}

	printf("}\n");

	return 0;

}

Add.h:

#ifndef ADD_H_

#define ADD_H_

void VecAdd(int *A, int *B, int *C);

#endif

Add.cu:

__global__

void VecAdd_Kernel(int *A, int *B, int *C){

	int i = threadIdx.x;

	C[i] = A[i] + B[i];

}

void VecAdd(int *A, int *B, int *C){

	size_t intSize = sizeof(int);

	int *d_A;

	int *d_B;

	int *d_C;

	cudaMalloc(&d_A, 4*intSize);

	cudaMalloc(&d_B, 4*intSize);

	cudaMalloc(&d_C, 4*intSize);

	cudaMemcpy(d_A, A, 4*intSize, cudaMemcpyHostToDevice);

	cudaMemcpy(d_B, B, 4*intSize, cudaMemcpyHostToDevice);

	VecAdd_Kernel<<<1, 4>>>(d_A, d_B, d_C);

	cudaMemcpy(C, d_C, 4*intSize, cudaMemcpyDeviceToHost);

}

I am on Win 7 32, using Eclipse CDT hacked to use nvcc as a compiler and MS link as linker.

njuffa · April 27, 2011, 6:21pm

.cu files are processed through a C++ frontend, which cause the function name to be decorated. The decorated name then does not match the plain C function name in main.c. The following adjustment should fix this. In Add.h, use

extern “C” void VecAdd(int *A, int *B, int *C);

then add

#include “Add.h”

at the top of Add.cu. That way the compiler is instructed to generate an undecorate, plain C, symbol for VecAdd(). Note that this issue is not specific to CUDA, but is a generic issue that affects all mixed C++ and C builds.

Shagrat · April 27, 2011, 6:32pm

I tried that already b4, but when I add this extern “C”, the compiler now returns a syntax error:

Main.h:

#ifndef ADD_H_

#define ADD_H_

extern "C" void VecAdd(int *A, int *B, int *C);

#endif

Shagrat · April 27, 2011, 6:54pm

I resolved the problem by wrapping the extern directive in #ifdef:

#ifdef __cplusplus

   extern "C"

#endif

Now it compiles OK

Topic		Replies	Views
Linking Problems CUDA Programming and Performance	3	1258	February 4, 2011
Compiling C and CUDA code Problems linking CUDA code and C code CUDA Programming and Performance	7	19398	November 4, 2011
linker errors CUDA Programming and Performance	2	3920	July 15, 2008
libraries linkage error mixing c and cuda CUDA Programming and Performance	3	1693	June 25, 2012
link error at __host__ __device__ function CUDA Programming and Performance	1	1634	July 25, 2008
Noob Q: How to extern c function? CUDA Programming and Performance	19	23913	June 30, 2010
Problem with linking file CUDA Programming and Performance	2	3546	March 22, 2011
Compiling / linking CUDA apps? CUDA Programming and Performance	8	4997	September 21, 2009
global variables CUDA Programming and Performance	4	3587	October 22, 2007
NVCC forces c++ compilation of .cu files CUDA Programming and Performance	11	26152	December 11, 2011

Linking cuda header to host code

Related topics