I’m wondering, is there something essentially wrong with the following code:
result = 0;
for(int i = 0; i < 10000; i++)
for(int j = 0; j < 10000; j++)
result = i0.01 + j0.01;
Console error output: Unspecified launch failure
The code executes fine on the GPU if i put 1000 instead of 10000 at the loop conditions but crashes with the given values. This is basically my very first cuda program and i was mainly just trying to test random constructs. So don’t mind about the usefulness of the given code.
In case of a not so obvious problem it would be kind to point me to the given section that might explain the problem in the programming guide.
thanks in advance.