global memory access synchronous or asynchronous read/write?

Mark_Harris · May 15, 2008, 6:07pm

The architectural term for this is “scoreboarding”. The processors on our GPUs are fully scoreboarded. Also, the compiler tries to schedule available non-dependent instructions after high-latency instructions (such as global loads) in order to better hide latency. Note that loads can be used to hide the latency of other loads. For example, if you have 4 loads per thread, it is better to do this:

load( a )

load( b )

load( c )

load( d )

math( a )

math( b )

math( c )

math( d )

Than this:

load( a )

math( a )

load( b )

math( b )

load( c )

math( c )

load( d )

math( d )

Let’s say each instruction takes 4 cycles per warp to issue, and the load latency is 400 cycles, and you have 25 warps. Then the first case will finish math(d) after 800 cycles, with zero stalls. The second case will finish math(d) after 2000 cycles, with 60% of those cycles spent stalled waiting on loads.

The compiler tries to do the first case. :)

Mark

Topic		Replies	Views
global memory read after write CUDA Programming and Performance	4	3399	March 25, 2009
global memory latency CUDA Programming and Performance	12	16390	December 13, 2007
Warp asynchronisity and coalesced r/w confusion CUDA Programming and Performance	0	525	June 14, 2020
Effective global memory bandwidth? CUDA Programming and Performance	17	17777	September 18, 2007
CUDA Memory Consistency CUDA Programming and Performance	23	56098	March 8, 2007
hiding global memory access do I need 2 warps? CUDA Programming and Performance	1	1004	January 22, 2010
Global Memoy latencies and NVIDIA cards Latency CUDA Programming and Performance	15	9100	January 11, 2008
__syncthreads and blocking memory accesses CUDA Programming and Performance	1	3776	February 5, 2009
Hiding memory read latency CUDA Programming and Performance	0	1776	July 16, 2007
Parallel Access to GDU Global Memory CUDA Programming and Performance	9	9093	January 24, 2008

global memory access synchronous or asynchronous read/write?

Related topics