Hi I have compiled the cdpSimpleQuicksort example from the SDK in CUDA 5. I run into the following error when trying to run it on a K20:
CUDA error at cdpSimpleQuicksort.cu:131 code=4(cudaErrorLaunchFailure) “cudaDeviceSynchronize()”
CUDA error at cdpSimpleQuicksort.cu:153 code=4(cudaErrorLaunchFailure) “cudaMemcpy( results_h, results_d, n*sizeof(unsigned), cudaMemcpyDeviceToHost )”
Any help is appreciated!
-KD