Serialization within a for loop?

macvacaent · June 17, 2019, 8:16pm

Hi! One thing I wasn’t able to test out recently was whether everything inside a for loop in a cuda kernel would be done sequentially? if not is any way to make operation inside a for loop serialized (unless of course with dynamic parallelization another kernel is called inside which would subsequently call a new sets of sub threads to continue the operation ))

Robert_Crovella · June 17, 2019, 8:41pm

With respect to a single thread, everything is sequential (not including CDP - cuda dynamic parallelism) just as you would expect for a C or C++ style thread of execution.

There is no implied order of operations in the CUDA programming model when considering operations in separate threads, other than what you as a programmer explicitly provide via synchronization functions, cuda cooperative groups, etc.

Topic		Replies	Views
Loops in kernels CUDA Programming and Performance	2	1317	September 3, 2009
Possible to paralellize dependent forloop in cuda? Each iteration has to occur in the order CUDA Programming and Performance	1	4000	June 16, 2008
Parallelizing for loops using CUDA CUDA Programming and Performance	3	2533	March 8, 2012
Question about nested for-loop, and how it works CUDA Programming and Performance	2	1472	September 28, 2020
thread local 'for loop' question thread parallel for loop execution CUDA Programming and Performance	5	3388	August 29, 2007
beginner cuda question sequential programming CUDA Programming and Performance	3	5082	September 9, 2008
Serialize inner loop (CUDA C) Legacy PGI Compilers	4	2511	November 29, 2011
Converting a for loop to cuda CUDA Programming and Performance	2	2103	June 14, 2012
CUDA functions How should CUDA functions be called? CUDA Programming and Performance	7	5559	August 13, 2009
Synchronizing threads CUDA Programming and Performance	1	5923	March 21, 2007

Serialization within a for loop?

Related topics