What is double buffering?

Sonia · November 30, 2009, 6:49am

Hi,

Can anybody tell me what is double buffering in CUDA or in general and why do we use it?

Thanks

Sarnath · November 30, 2009, 7:23am

Say you want to convert values in a BUFFER to another…(say Farenheit to Centigrade) …each thread works on an element… and would convert the same… No problem…

However, say, if each thread has to compare itself to its left and right successors and update the maximum or average or in gernaral a function of all three — then, you have race condition…

Thats when you use double buffering… ALl threads “read” from one buffer and update the other buffer…and so on…

Usually doub buff happens in loop… like BufferA is read, BufferB is written, Syncthreads, BufferB is read, BufferA is written, Syncthreads,… and so on

Sonia · November 30, 2009, 7:42am

Thanks Sarnath for your quick reply!

Well, the following implements a double-buffered version of the sum scan

for d := 1 to log2n do

	forall k in parallel do

		  if k â‰¥ 2d then

			   x[out][k] := x[in][k âˆ’ 2d-1] + x[in][k]

		  else

			   x[out][k] := x[in][k]

	swap(in,out)

I am not able to find where is BUFFER ‘A’ and where is BUFFER ‘B’ in the above code.

The corresponding CUDA code is:

__global__ void scan(float *g_odata, float *g_idata, int n)

{

	extern __shared__ float temp[]; // allocated on invocation

	int thid = threadIdx.x;

	int pout = 0, pin = 1;

	// load input into shared memory.

	// This is exclusive scan, so shift right by one and set first elt to 0

	temp[pout*n + thid] = (thid > 0) ? g_idata[thid-1] : 0;

	__syncthreads();

	for (int offset = 1; offset < n; offset *= 2)

	{

		pout = 1 - pout; // swap double buffer indices

		pin = 1 - pout;

		if (thid >= offset)

			 temp[pout*n+thid] += temp[pin*n+thid - offset];

		else

			 temp[pout*n+thid] = temp[pin*n+thid];

		__syncthreads();

	}

	g_odata[thid] = temp[pout*n+thid1]; // write output

}

Sarnath · November 30, 2009, 7:52am

“in” and “out” are the 2 buffers… Note that the last statement in the first code box, comes under the OUTER Loop… Thats where the buffers are exchanged… Isnt it?

Although in and out are just indices, the way they are using in the code says that they operate at different dimensions of a multi-Dimensional array… Jus thnk about it

Sonia · November 30, 2009, 8:25am

“in” and “out” are always the first dimension x, so I simply did not get how they are operating at different dimensions :(

Sarnath · November 30, 2009, 8:56am

Even if it in the same dimension, they are different buffers, isn’t it?

The code reads one buffer and updates another buffer… If there was no another buffer, there would be lot of race… isnt it?

Topic		Replies	Views
Someone can help me with the Scan application? CUDA Programming and Performance	0	1913	August 25, 2008
CUDA double buffer (producer/consumer) CUDA Programming and Performance	8	4035	November 20, 2017
How do I understand data prefetching with a double buffer? CUDA Programming and Performance cuda	1	671	March 21, 2024
How to use double Buffer CUDA Programming and Performance	3	6568	October 8, 2008
SSBO double buffering with compute shaders OpenGL	0	1356	June 7, 2017
No results are written to output buffer - why? CUDA Programming and Performance	34	19995	July 15, 2009
What is the best approach to write double-buffer pipeline? CUDA Programming and Performance cuda	0	655	October 31, 2023
Compare approach Two buffer comparition CUDA Programming and Performance	4	969	August 17, 2011
double buffer how to do it efficiently? CUDA Programming and Performance	6	2894	June 15, 2011
Race Conditions CUDA Programming and Performance	11	6317	June 1, 2010

What is double buffering?

Related topics