WARP , Shared Memory, synchronization Synchronisation within WARP threads

Sarnath · December 15, 2007, 2:19pm

HI,

The manual 1.0 says that threads inside a warp cannot assume strict ordering among themselves with respect to shared memory. One has to use __syncthreads for this. But __syncthreads makes it to synchronize with all other threads in the BLOCK which is a very costly operation.

When I experimented with the PTX assembly code that is generated I realized that the compiler optimizes the LOAD from shared memory with “registers”. So, I guessed that the use of “volatile” keyword would solve the problem. And, it did. Later, I also found the same reference in 1.1 manual too. So, THis kinda confirms what I have seen.

I just want a small confirmation from NVIDIA that apart frm the “volatile” keyword there is NO other restriction (like a hardware limitation) that prevents a strict ordering among threads in a warp.

i.e.

If I use volatile variables then THREAD I should be able to see the latest volatile Shared memory data that was generated by THREAD J – given I and J belong to same warp.

Thank you

Sarnath · December 18, 2007, 2:59am

Greatly Appreciate a reply for this question. Thanks a lot.
The algo that I am going to choose depends on this and this can matter a lot.
Thank you.

Topic		Replies	Views
are threads of a warp really sync? CUDA Programming and Performance	2	813	August 3, 2011
Volatile Keyword What exactly does it do? CUDA Programming and Performance	3	2393	February 17, 2009
questions about thread execution & volatile CUDA Programming and Performance	19	17028	December 29, 2008
if-else WARP divergence WARP divergence CUDA Programming and Performance	17	16954	January 5, 2008
Shared mem RAW without sync CUDA Programming and Performance	3	3419	January 5, 2009
warp synchronization test CUDA Programming and Performance	5	1721	September 2, 2014
is syncthreads needed when will divergent threads in same warp re-sync CUDA Programming and Performance	9	3320	January 23, 2012
Shared Memory and Read After Write CUDA Programming and Performance	2	1529	July 2, 2009
Race condition within warp CUDA Programming and Performance	9	3116	September 20, 2016
Is syncthreads required within a warp? CUDA Programming and Performance	10	12566	November 8, 2013

WARP , Shared Memory, synchronization Synchronisation within WARP threads

Related topics