I had developed a kernel function that sort an array. I transferred the array to global memory , with array size maximum of 810241024 and compiles and run correctly using (8*1024+9) blocks with 1024 thread per block
if i increased the size to 1610241024 and upper, the program compiles and run, but the array is not sorted (may the kernel not work).
can any one explain what happened and suggest the solution?
I’m using Geforce Gt 740 m GPU