calloc like function inside CUDA kernel

Hi,

As calloc is not used inside CUDA kernels to allocate memory & initialize it, is there any function in CUDA to do so? I know cudaMemset, but its used on host part of the code. Or initialization should be done manually?

Note: Specifically memory allocation is required inside CUDA kernel function under each of the CUDA threads

Thanks