My understanding is we have two options:
kernel<<<numBlocks, numThreadsPerBlock>>>( arguments, ...);
Or:
kernel<<<numBlocks, numThreadsPerBlock, ???, stream>>>( arguments, ...);
I have been searching for hours but can’t find anything useful on what the ??? is except that most people seem to set it to zero.
Any help? In basic terms? Where is this documented?