RadixPrefixSum kernel is hanging

It worked once, but in the second and subsequent executions the RadixPrefixSum kernel from the Radix code hung. Is the SDK code OK for sorting 5.3 million elements? Do I need to alter any numbers?