I have a program that dilates a AxA matrix by BxB matrix, and I’m trying to find out limitations of the program I wrote.
So I try to dilate 4096 x 4096 matrix by a 6x6 matrix, and the program runs fine.
But if I dilate 4096 x 4096 matrix by a 8x8 matrix, then as my program gets to the kernel code, my monitor turns black, and comes back on a second later. The program fails, and Windows pops up a message saying that ‘Display Driver Stopped Responding and Has Recovered’.
Does this mean my graphics card doesn’t have enough memory to handle this operation? I don’t think its the case though…
My program requires AxA processors run in parallel, each processor performs a nested for loop B times for each loop.
I haven’t tried optimizing my kernel yet. Would this problem go away after I optimize my code (shared memory, etc)?
Thanks in advance!