cuRAND Library Function curand_init taking unexpected amount of memory

Subho · May 4, 2017, 9:07pm

Here is a simple program

#include <stdio.h>
#include <cuda.h>
#include <curand_kernel.h>
#include <unistd.h>

global void
func(void){
int id = blockIdx.x * blockDim.x + threadIdx.x;
curandState s;
curand_init(0,id,0,&s);
for(int i=0;i<4;i++){
printf(“%d %f\n”,id,curand_uniform_double(&s));
}
}

int main(void){
cudaSetDeviceFlags(cudaDeviceMapHost);
int const n_thread = 1;
int const n_block = 1;
size_t start_mem, end_mem, total_mem;

cudaMemGetInfo(&start_mem, &total_mem);
func<<<n_block,n_thread>>>();
printf(“Version = %d\n”,CUDA_VERSION);
cudaThreadSynchronize();
cudaMemGetInfo(&end_mem, &total_mem);

printf(“used memory = %ld MB\n”, (start_mem - end_mem)/1048576);

return 0;
}

Just one call of curand_init is taking 298 MB of GPU memory in Titan X pascal and 267 MB memory in Titan X maxwell. This is a huge bottleneck in our application as we are running multiple instances of a process which calls curand_init. What is the reason curand_init is allocating GPU memory?

Robert_Crovella · May 6, 2017, 6:01pm

Library initialization involves memory allocations. If you’d like to see the amount reduced, I suggest filing a bug at developer.nvidia.com. It should be sufficient to refer back to this forum thread in the bug.

snarky · May 8, 2017, 8:21pm

It’s also totally possible that curand_init() pre-generates and uploads a high quality random number buffer, used to later generate random numbers quickly.
This approach seems likely based on this comment in the documentation for curand: The resulting random numbers are stored in global memory on the device."
Read more at: http://docs.nvidia.com/cuda/curand/index.html#ixzz4gWLEZLu6

Topic		Replies	Views
curand eats device memory CUDA Programming and Performance	13	16642	January 28, 2011
CURAND acting strangely CUDA Programming and Performance	13	21319	April 26, 2011
CUDA pseudorandom number generator CURAND CUDA Programming and Performance	4	8832	July 7, 2011
cuSolver handle GPU memory use GPU-Accelerated Libraries cublas , cusolver	3	1299	October 6, 2022
Curand allways get the same numbers Compute Sanitizer cuda	1	810	January 19, 2022
350MB of GPU memory disappear right after context initialization CUDA Programming and Performance	5	1366	February 10, 2011
Cuda device memory usage upon initialisation Can it be reduced? CUDA Programming and Performance	0	1391	November 18, 2008
Curand, my implementation works, but I am not sure it's the right way to do it CUDA Programming and Performance cuda	3	966	April 26, 2021
slow cuda computing time (need idea for my cuda coding) Teaching and Curriculum Support	1	1386	August 28, 2014
How to get cuRAND to generate random bits for custom type GPU-Accelerated Libraries curand	1	471	December 30, 2023

cuRAND Library Function curand_init taking unexpected amount of memory

Related topics