There is an NVIDIA whitepaper by Mark Harris called Parallel “Prefix Sum (Scan) with CUDA”, which describes the implementation of the scan algorithm in CUDA. It was also published in GPU Gems by NVIDIA, at http://http.developer.nvidia.com/GPUGems3/gpugems3_ch39.html.
This doc is at http://developer.download.nvidia.com/compute/cuda/1_1/Website/projects/scan/doc/scan.pdf, and several other servers across the Internet. I found a newer version, dated March 25, 2009, that “fixed a number of typos”, but it still contains at least one error I found. I’m wondering if there is a more up-to-date version of the document.
Does anyone know if there where I can find the latest version of this whitepaper?