It seems as though if a kernel takes too long, the clEnqueueReadBuffer function returns CL_INVALID_COMMAND_QUEUE.
You can recreate this bug by changing the code for VectorAdd.cl from this:
[codebox]// add the vector elements
c[iGID] = a[iGID] + b[iGID];[/codebox]
// add the vector elements
c[iGID] = a[iGID] + b[iGID];
If you do this, the VectorAdd example fails. You can do something similar for the other examples and make them fail.
Is there a time limit on clEnqueueReadBuffer before it will return -36? Is there a way to adjust this time limit?
Also, when I add that for loop, my screen goes black for a second and I get a windows popup saying my NVIDIA adaptor failed but then came back online. Is there a general rule that kernels should not take more than ms to process?
Evolutionary Complexity Research Group
University of Central Florida