I figured out a problem and could not find out what causes this.
I am trying to call multiple kernels.
cudaMalloc array1, array2, array3…;
this works fine. But,
cudaMalloc array1, array2;
cudaMalloc array3, …;
kernel2<<<>>>; // also uses array1 and array2
In Case 2 program crashes almost half of the time. But in Case 1 never…
Is there something I should do for case 2?
Thanks in advance…