Does the CUDA runtime properly “do nothing” if a kernel is invoked with zero threads, i.e. blocksize == 0?
I’d like to avoid a test for zero, if CUDA is going to do that anyway.
Does the CUDA runtime properly “do nothing” if a kernel is invoked with zero threads, i.e. blocksize == 0?
I’d like to avoid a test for zero, if CUDA is going to do that anyway.
Passing a block or grid size containing a zero for any dimension will generate an execution parameter error IIRC.