Parallel Prefix Sum (Scan) with CUDA Latest version of document?

Hi Folks,

There is an NVIDIA whitepaper by Mark Harris called Parallel “Prefix Sum (Scan) with CUDA”, which describes the implementation of the scan algorithm in CUDA. It was also published in GPU Gems by NVIDIA, at http://http.developer.nvidia.com/GPUGems3/gpugems3_ch39.html.
This doc is at CUDA Toolkit Documentation, and several other servers across the Internet. I found a newer version, dated March 25, 2009, that “fixed a number of typos”, but it still contains at least one error I found. I’m wondering if there is a more up-to-date version of the document.

Does anyone know if there where I can find the latest version of this whitepaper?

Ken

Hi Folks,

There is an NVIDIA whitepaper by Mark Harris called Parallel “Prefix Sum (Scan) with CUDA”, which describes the implementation of the scan algorithm in CUDA. It was also published in GPU Gems by NVIDIA, at http://http.developer.nvidia.com/GPUGems3/gpugems3_ch39.html.
This doc is at CUDA Toolkit Documentation, and several other servers across the Internet. I found a newer version, dated March 25, 2009, that “fixed a number of typos”, but it still contains at least one error I found. I’m wondering if there is a more up-to-date version of the document.

Does anyone know if there where I can find the latest version of this whitepaper?

Ken

Did you ever find a more recent version?