Hi guys, I’m working on my thesis, a parallel Artificial Neural Network implementation. Basically i read a file with the problem to solve as input, and all goes fine, until i have to use an immense input file.
My code starts to give this error “execution failed : (6) the launch timed out and was terminated.” The problem is that the error line is a cudamemcpy!! not a kernel launch!
cudaMalloc( (void**) &m_dLayersSize,m_numOfLayerssizeof(uint32_t ));
optimizerCudaCheckError(“initSolutionSet: cudaMalloc() execution failed\n”, FILE, LINE);
cudaMemcpy(m_dLayersSize, m_hLayersSize, m_numOfLayerssizeof(uint32_t ), cudaMemcpyHostToDevice);
optimizerCudaCheckError(“initSolutionSet: cudaMemset() execution failed\n”, FILE, LINE); //this line gives the error
I checked the host variables and they’re ok! m_numOfLayers is 4 and the values in the host vector are 2500,300,100,27.
With other examples all goes fine… i’m a bit confused… i think that the error isn’t really true, but something that involve memory and thread launch complexity.
My card is a 560gtxTi (2.1 spec) and the cuda sdk version is 4.0
I can’t post my code since there are 3 libraries… something like 4000 lines
Thanks to all, Enrico
edit: if i run the program with the gdm shutted down, as i expected, the program goes a little further but then fills the screen with artifact and stopped… i’m so desperate