I am a novice CUDA programmer. I am trying to use GPU-computing for simulation of some big electrical networks. I am facing the following problem, if anybody knows about this similar problem, please help.
I have to initiate a matrix of size NxN, this matrix elements do not change throughout the simulation for one program.
I have to use this matrix iteratively, depending on the duration of simulation, number of iteration can be very big number. In each iteration I have to multiply this matrix with few Vectors of size N (elements of these vectors are not fixed, they are updated in each iteration, which requires other variable defined in the program), (In my case, I am using GPU to do this multiplication).
After the above multiplication, I have to calculate/update each of the previously defined variables, which require accessing, previous values of these defined variables (which are commonly called History terms, in the literature), and also, require to update those values to the new value depending on the newly calculated values from the result of matrix vector multiplication. These generates another few set of new vector of length of N, then again I have to go for matrix vector multiplication, and repeat the process.
Now the problem I am facing, if this N=83, my program is OK for any number of iteration, working very good. But, when, I am changing to N=84, the program seems to work, for only around 1500 iteration after that it generates, ‘nan’ instead of the expected results. Compilation and running the program does not generates any error.
If anybody knows about similar problem, please share your experience.
Thank you in advance.