issue with constant memory in different files invalid device symbol

03xc355 · June 18, 2010, 6:06pm

Heya,

I have been working on a CUDA app to speed up some video processing routines. I recently decided to split one of the longer .cu files into multiple .cu files (it had two kernels on it). I was using some constant memory as a transformation matrix that both kernels use, but I can’t seem to make the constant memory work in more than one cu file.

This is what I have tried:

extern device constant float pix_transform[9]; in the kernel .cu files

and

device constant float pix_transform[9]; in the driver.cu file.

Strangely enough, when stepping through the program,

checkCUDAError(cudaMemcpyToSymbol(“pix_transform”, pixel_m, sizeof(pix_transform), 0, cudaMemcpyHostToDevice));

returns an invalid device symbol error.

However, when I remove the extern constant declaration for all other files, this works fine. I’m kind of at a loss, because constant memory seems to be rather not well documented, and I only got it working in the first place with some trial and error, and looking at other people’s code.

So my question is, is there a way to do this? To keep constant memory between kernels, and to keep them in different files?

Thanks in advance

avidday · June 18, 2010, 6:26pm

There is no linker in device code, so things like texture declarations, constant memory declarations and global memory symbols have file scope only. So you can’t directly refer to a device symbol declared in one file in another. The usual work around is to write “access” host stub functions which can be called from anywhere and wrap things like texture binding and device memory symbol access. It isn’t always all that pretty, but it usually suffices.

tera · June 18, 2010, 6:30pm

CUDA doesn’t have a linker on the device side, so it can’t support [font=“Courier New”]extern[/font]. You could probably do with having two pointers, one in each .cu file, and initializing them to the same allocation. Then again, for just 9 floats this does not really seem worth it.

03xc355 · June 18, 2010, 6:52pm

Ahh okay. Didn’t know that, thank you!

I actually have a bunch of constants, around 3 3x3/4x4 transformation matrices, 2 2x1 constants matrices, and 1 5x1 constant matrices. I’m working on a quadro 4600, so I have to comply with compute 1.0, so I’m a bit cramped for shared mem. What do you think would be my best solution for a situation like this? I think could potentially fit it all in the parameters of the kernel, but that seems kind of messy, and I might have to recalc some more stuff per kernel.

Thanks once again

Topic		Replies	Views
constant memory declaration How to? CUDA Programming and Performance	7	4854	March 27, 2008
Constant Memory problem CUDA Programming and Performance	12	1393	January 30, 2015
Problem with __constant__ can't use same name for __constant__ in different files CUDA Programming and Performance	1	2778	October 26, 2013
Constant Memory and File Organization CUDA Programming and Performance	4	5405	May 12, 2010
SCOPE __constant__ memory issue CUDA Programming and Performance	5	2116	December 5, 2011
copy device memory to constant memory CUDA Programming and Performance	4	12501	November 11, 2008
__constant__ qualifier questions, please help CUDA Programming and Performance	8	6306	November 20, 2009
cudaMemcpyToSymbol do not copy data CUDA Programming and Performance	3	5807	August 12, 2009
Problem with costant memory Can I define it as external CUDA Programming and Performance	7	1563	September 27, 2010
Cuda constant memory CUDA Programming and Performance cuda , kernel	5	3128	September 7, 2023

issue with constant memory in different files invalid device symbol

Related topics