malloc of one kernel in another kernel Memory allocated in one kernel can be accessed in another ker

sanf · January 22, 2012, 7:08am

Is it possible to access memory, which is allocated in one kernel in another kernel function.

for example: some thing like this:

__global__ void testmalloc()

{

float *p1, *p2;

p1 = malloc(10*sizeof(float));

}

__global__ void testm()

{

int i;

for(i=0; i<10; i++)

p1[i] = 10.0 + i;

}

main()

{

testmalloc<<<1,1>>>();

testm<<<1,1>>>();

}

In which method this can be achieved?

cmaster.matso · January 22, 2012, 9:49am

(…)

__global__ void testmalloc()

{

float *p1, *p2;

p1 = malloc(10*sizeof(float));

}

__global__ void testm()

{

int i;

for(i=0; i<10; i++)

p1[i] = 10.0 + i;

}

main()

{

testmalloc<<<1,1>>>();

testm<<<1,1>>>();

}

(…)

Does Your code even compile? Beyond that isn’t it better to allocate global memory from within the host code and then pass pointer to it to both kernels?

Regards,

MK

pasoleatis · January 22, 2012, 3:58pm

I was not aware you can allocate memory from kernels, if it is I wonder what happens since every thread will try to allocate the same array.

cmaster.matso · January 23, 2012, 7:43am

As far as I know using ‘malloc’ in kernel refers to dynamic allocation of local memory (or the one for a single thread only in other words). But, please, correct me if I’m wrong.

Regards,

MK

pasoleatis · January 23, 2012, 7:50am

And what would be the benefit of using malloc inside a kernel vs declaring like this float p[10];
Did someone ever use malloc inside kernel successful? In the tutorials I read the examples never had malloc inside a kernel. This is why it just seems out of places.

In the example given in the first post the p1 is allocated in one kernel and then used in another so the intent is to use it as a variable in the global memory.

cmaster.matso · January 23, 2012, 8:02am

pasoleatis: You are absolutely right. Calling ‘malloc’ for every thread can cause a great drop in performance of Your application. It is possible to use it (as like as the ‘new’ operator, with device compute capability >= 2.0), but not very efficient, I think.

Topic		Replies	Views
cudaMalloc from inside a kernel CUDA Programming and Performance	3	12657	September 2, 2009
malloc shared memory to 1.1 device and cudaDeviceMapHost CUDA Programming and Performance	9	29389	April 15, 2011
want to allocate memory inside kernel CUDA Programming and Performance	2	1445	July 13, 2009
Question Dynamic Memory Allocation in the kernel function CUDA Programming and Performance	2	3628	November 30, 2009
memory allocation question CUDA Programming and Performance	6	4148	April 29, 2011
malloc in a kernel CUDA Programming and Performance	2	1774	July 1, 2009
cudaMallocHost and pthreads issues with accessing memory from different threads CUDA Programming and Performance	3	3319	November 14, 2008
Simple Question about kernels and global memory CUDA Programming and Performance	4	3990	June 12, 2009
malloc/realloc on __device__ CUDA Programming and Performance	4	3941	February 10, 2010
Memory usage within GPU CUDA Programming and Performance	2	2351	July 13, 2009

malloc of one kernel in another kernel Memory allocated in one kernel can be accessed in another ker

Related topics