Using cudaMemcpyToSymbol for constants declared in a separate file

martinkuhnel · July 20, 2022, 7:22pm

Hello! First time posting in this forum so let me know if I am missing anything. I have been trying to find a solution to this issue and haven’t found a match online.

I currently have two files with kernels (kernel1.cu and kernel2.cu) and may add more. The kernels are both called from another cuda file (manager.cu) which handles all of the device allocation and copy in/out. Also, both kernel files share constants, so they both include another file (constants.cuh). However, some of these constants are not known until runtime so I was hoping to use cudaMemcpyToSymbol to define them from the manager. Here is a simplified layout of the code:

constants.cuh

#pragma once
__device__ __constant__ int testconst;

kernel1.cu

#include "kernel1.h"
#include "constants.cuh"

__global__ void testkernel() {
printf("%d", testconst);
}

manager.cu

#include "manager.h"
#include "kernel1.cu"
#include "constants.cuh"

void testwrapper() {
int hostconst = 1;
cudaMemcpyToSymbol(testconst, &hostconst, sizeof(int));
testkernel<<<1,1>>>();
}

The manager.h and kernel1.h files simply declare the testwrapper and testkernel functions.
When this testwrapper is run, the testconst does not get defined and the printf outputs 0. If I define testconst directly in the constants file, it is read back correctly. It also works if all of these are combined into one file, however then kernel2 cannot reuse this. Any ideas for how to get the copy to work in this sort of layout?

Thank you for your help!

njuffa · July 20, 2022, 8:18pm

I am confused. There does not seem to be a kernel2 in the posted code? What is manager.h? Is it needed to reproduce your observations?

It would also be useful if you could share the exact command lines used to invoke nvcc (and the linker, if that is run as a separate step). When you run a failing case under control of compute-sanitizer, are any errors reported?

Robert_Crovella · July 20, 2022, 8:44pm

As indicated by njuffa, I consider it good practice to provide complete test cases when asking questions here. The simpler the better.

When using a global symbol in more than one compilation unit in C or C++, you should go through a set of thought processes to make sure your usage is correct.

A __device__ symbol defined in one compilation unit is not the same as the exact same __device__ symbol defined in another compilation unit, unless you take explicit steps to make it so. This will involve:

proper use of the C++ extern keyword
in the case of CUDA __device__ (or, equivalently, __constant__), proper use of relocatable device code with device linking

this and this may be relevant

martinkuhnel · July 21, 2022, 9:20pm

Thank you for the help and the feedback! My apologies for not including the rest of the files. What Robert provided did solve my problem after going through the resources and applying it to my situation. For those with a similar problem it came down to:

Using extern in the constant file
Defining the constant in the manager.cu file
Using the -rdc=true flag on the nvcc compile and, since I am using cmake, setting the CUDA_SEPARABLE_COMPILATION property to ON

Thank you again!

Topic		Replies	Views
BUG: Linking together multiple .cu files and using the same __constant__ symbols CUDA Programming and Performance	3	7987	November 17, 2009
Constant variabiles scope CUDA Programming and Performance	5	1773	June 9, 2009
Constant Memory and File Organization CUDA Programming and Performance	4	5385	May 12, 2010
Device constant memory from shared object CUDA Programming and Performance	1	6416	June 9, 2009
C# and cudaMemcpyToSymbol CUDA Programming and Performance	2	3398	December 23, 2008
CUDA constant variables not loading properly in kernels CUDA Programming and Performance	2	1483	January 30, 2016
cudaMemcpyToSymbol do only once at startup? CUDA Programming and Performance	1	2944	June 10, 2008
__constant__ stopped working CUDA Programming and Performance	20	4505	October 17, 2009
Problem with __constant__ can't use same name for __constant__ in different files CUDA Programming and Performance	1	2767	October 26, 2013
__constant__ memory which is device-side only (avoiding cudaMemcpyToSymbol) CUDA Programming and Performance	9	9286	June 15, 2017

Using cudaMemcpyToSymbol for constants declared in a separate file

Related topics