Poor accuracy of curand Sobol at high dimensions

matthew.arnold-1 · May 25, 2021, 1:38am

I have been using the curand Sobol device api “sphere” example as a starting point for my own code, but I have noticed a couple of potential problems with it. One is that it is unexpectedly slow - I found it could be sped up a few times by caching the generator states in the kernel (at lower dimensionality), presumably reducing the bandwidth required for transferring the (quite large) states.

More concerning is poor accuracy, which seems to be related to its use of separate dimensionality for each thread. Probably this is not a good strategy for optimal performance anyway, but it is useful for validation of the generator at high dimensions. I found that reducing dimensionality significantly improved accuracy. This is possibly related to the initialization vectors - I note curand only cites Joe & Kuo’s first paper, not their second one that addresses some problems at high dimension (see https://web.maths.unsw.edu.au/~fkuo/sobol/). It is also worth noting that even the corrected vector set only satisfies “Property A” up to dimension 1111.

Are the developers aware of the problems with the original Joe & Kuo set, and is the first or second set included in curand?

mnicely · May 28, 2021, 1:14pm

cuRAND uses new-joe-kuo-6.21201 set. Sobol example was not focused on getting the best performance.

To get the quality you’re looking for, you’ll need to use the Host API

matthew.arnold-1 · May 28, 2021, 9:19pm

Thanks for the info. Is there any reason to expect a difference in output from the host api? In my testing (at low dimensions) the answer seemed to be the same. I was also able to get a bit more speed from the device api, but perhaps my implementation with the host api wasn’t quite optimal.

Topic		Replies	Views
Confused about CURAND Sobol generator. CUDA Programming and Performance	7	5288	May 23, 2013
Error in Sobol direction vectors in curand curandGetDirectionVectors32 failing at runtime CUDA Programming and Performance	0	3955	September 24, 2011
how to use curand with CURAND_RNG_QUASI_SOBOL32 option? GPU-Accelerated Libraries	5	761	November 8, 2019
CURAND CURAND low per CUDA Programming and Performance	8	3130	April 12, 2011
why i need setup_kernel for curand states? GPU-Accelerated Libraries	19	2746	June 14, 2019
Calculating variance of Quasi-Monte Carlo with scrambled Sobol GPU-Accelerated Libraries cuda	0	868	May 12, 2022
curand host performance GPU-Accelerated Libraries	6	1461	December 29, 2016
how to get same output by CURAND in CPU and GPU CUDA Programming and Performance	3	5951	July 19, 2011
CURAND question CUDA Programming and Performance	1	1450	December 1, 2010
Curand allways get the same numbers Compute Sanitizer cuda	1	849	January 19, 2022

Poor accuracy of curand Sobol at high dimensions

Related topics