global variables

allvano · December 11, 2007, 7:58pm

Hello guys,

just a bit confused about global variables usage and declaration :

[codebox]

int g_array;

__global kernel()

{

int idx = … ;

g_array[idx];

}

[/codebox]

The array “g_array” is just and only used from the kernel. Do i still need to use the cudaGetSymbolAddress() and then cudaMalloc() to allocate some memory? Or

is it allowed to allocate some mem directly form the kernel ?

thanks,

jj

zl25drexel · December 11, 2007, 10:36pm

you dont if you mark the array as global or shared

allvano · December 12, 2007, 10:39am

hmm, I’m marked them with device and it compiles but i receiving some strange results during runtime.

In emu mode things are working well, but not on hardware.

shared can not be used because the array is too big. global for variables ?

It is not specified in the documentation.

That was the reason why a asked if i need always use cudaGetSymbolAddress/cudaMalloc on hostside to allocate memory.

thanks,

jj

MisterAnderson42 · December 12, 2007, 12:50pm

You don’t need to always allocate memory with cudaMalloc (though it is certainly the most straightforward way to manage it). But an array declared device cannot be accessed on the host without cudaGetSymbolAddress and a cudaMemcpy. Are you just accessing g_array in the host code? This is likely the cause of your crash because it would dereference an invalid pointer. In emulation, “device” arrays are actually on the host, so it works without any warnings.

allvano · December 12, 2007, 1:18pm

Thank you for the reply.

To access the device g_array from the host will not work because the g_array is

not in the same memory space. This is clear.

The device g_array is only used by device functions to store some

temporary data in global memory. shared can not be used because of the data

size (around 250 Kb/Thread).

Because my code is working well in emu mode and partially on the hardware I was not sure if I understand the scope of global device variables well.

Maybe one another note. When I’m accessing the g_array using a fix number instead of a variable (like g_array[0] = 0;) things are working well. The index variable is declared device as well.

thank you,

jj

MisterAnderson42 · December 12, 2007, 4:08pm

Ok, it seems like you are doing everything correctly then. Perhaps the best thing you can do at this point is to create a minimal test case file that reproduces your problem (preferably one that can be directly compiled with nvcc -o exec file.cu) and post it here. There has to be some little detail you missed somewhere.

Topic		Replies	Views
ALLOCATING MEMORY IN KERNEL v FROM HOST CUDA Programming and Performance	5	4001	June 27, 2008
global arrays that can be used in different kernel calls CUDA Programming and Performance	1	3709	December 16, 2010
global device variables CUDA Programming and Performance	1	3401	June 25, 2007
Array in shared memory from kernel Create array in shared memory from kerne CUDA Programming and Performance	2	3386	May 12, 2008
device memory declared Globally not passed in CUDA Programming and Performance	1	1326	March 31, 2011
__device__ variables and arrays CUDA Programming and Performance	8	15571	August 16, 2014
how to use global device struct variables in device functions CUDA Programming and Performance	4	9304	May 19, 2011
__device__ array to __global__ Cant pass a __device__ array to __global__ CUDA Programming and Performance	3	2354	February 24, 2012
dynamically global memory allocation in __global__ or __device__ function? CUDA Programming and Performance	2	6359	November 17, 2009
accessing __device__ global variables CUDA Programming and Performance	2	1501	July 28, 2008

global variables

Related topics