Suppose I call two CURAND library functions in a kernel, say curand() and curand_uniform(), then do I need to call curand_init() twice, once for curand() and once for curand_uniform() for each thread, or is it sufficient to call curand_init() just once per thread, irrespective of number of CURAND library function calls inside the thread?