Curand: potential correlation among multiple parallel streams using Xorwow generator

Shihab_Khan · September 12, 2023, 7:48pm

In the doc it says:

If an experiment spans multiple kernel launches, it is recommended
that threads between kernel launches be given the same seed, and sequence
numbers be assigned in a monotonically increasing way

This is how I’m doing it in a simulation kernel:

template <typename RNG>
__global__ void initCurand(RNG* state, int seq_offset) {
    int idx = blockIdx.x * blockDim.x + threadIdx.x;
    if(idx >= NS)
        return ;
    curand_init(1984, idx + seq_offset, 0, &state[idx]);
}

In the main simulation step function:

int NS = 1000; // number of parallel streams i.e. random generators I need
cudaMalloc(&d_state, NS * sizeof(RNG));
......
for (int iter = 0; ; ++iter) {
        // Initialize curand states
        initCurand<<<blocksPerGrid, threadsPerBlock>>>(d_state, NS * iter);

When using xorwow, I believe I’m getting streams with correlation between them. It fails PractRand test only after 64MB of data. Philox and MRG32 generators seem okay.

Here’s a link to full running code: Sep 12 7:42 PM - Codeshare

I tested with nvcc 12.2, gcc 11.4.

Edit: the tests it fails are follows:

length= 64 megabytes (2^26 bytes), time= 8.3 seconds
  Test Name                         Raw       Processed     Evaluation
  [Low8/32]BRank(12):768(1)         R= +2628  p~=  2.9e-792   FAIL !!!!!!!   
  [Low1/32]BRank(12):384(1)         R= +4695  p~=  2e-1414    FAIL !!!!!!!!  
  ...and 140 test result(s) without anomalies

Topic		Replies	Views
making XORWOR parallel with curan_init concerns about statistical properties curan-library CUDA Programming and Performance	4	5500	August 25, 2011
CURAND (device) seems to give correlated outputs among threads how to avoid? CUDA Programming and Performance	4	9874	December 7, 2011
curand_init sequence number problem CUDA Programming and Performance	8	1642	December 28, 2017
XORWOW Generator Understanding in CURAND GPU-Accelerated Libraries curand	6	6189	May 6, 2023
Question about optimal cuRAND() use GPU-Accelerated Libraries	7	2768	April 27, 2015
Trying to understand CURand (curand_init) sequence input parameter CUDA Programming and Performance	5	5595	April 19, 2011
uncorrelated random numbers CUDA Programming and Performance	15	5556	June 29, 2011
Curand allways get the same numbers Compute Sanitizer cuda	1	846	January 19, 2022
CURAND question CUDA Programming and Performance	1	1444	December 1, 2010
Sequence number in curand_init() CUDA Programming and Performance	2	1276	September 18, 2013

Curand: potential correlation among multiple parallel streams using Xorwow generator

Related topics