Being a noob in programming on CUDA , I would like to know -
-
Can we allocate memory on device during runtime? (i.e use cudaMalloc in a kernel function)
<I somehow guess that the answer is no! External Image > -
How can I create say N trees parallely on CUDA.
Based on my requirements, I need to build 1 tree on 1 thread. So if i have N threads, implies that I should have N trees!
A code snippet might help a lot!
Thanks in advance.