Dynamic allocate memory inside kernel How to dynamic allocate memory inside kernel

Quoc_Vinh · June 10, 2009, 12:56am

Hi all.
I have a problem, and i am trying to resolve it.
I want to use stack for each thread in kernel, the stack size will be decides by each thread.
but i don’t know how to dynamic allocate local memory inside kernel.
if anybody know or has any suggestion, please help me.
thank you.

Tobi_W · June 10, 2009, 7:42am

It is simply not possible to allocate dynamically memory on the device inside a running kernel. The memory is allocated through the host.

And if I’m not wrong, cuda does not implement a stack as you have on the host/cpu side.

jph4599 · June 10, 2009, 12:51pm

This thread has some information about maintaining a local stack, complete with some example code:
[url=“http://forums.nvidia.com/index.php?showtopic=97118&view=findpost&p=547788”]http://forums.nvidia.com/index.php?showtop...st&p=547788[/url]

It looks like the MAX DEPTH of the stack has to be known ahead of time (since you can’t dynamically allocate memory).

Quoc_Vinh · June 12, 2009, 12:51am

Thanks @Tobi_W and jph4599. :)
I staticaly allocated local memory for each threads. and I use a variable for indexing element of array (same as stack). It works correctly.
But I must paied for waste local memory because i had no way to do.