I wrote a small program which basically has a kernel that adds one to each element in the matrix (matrix is in global memory)
i then transfer the global matrix to a host matrix and print the values
it works fine for small matrix’s but for some reason it doesnt work for big matrixs (30000x30000), remaining with the initial value given to the elements.
i tested it out and figured that the last size of matrix that works is 11056x11056, if you make the algorithm one bit bigger eg 11057x11057, the kernel doesnt add to the items in the matrix anymore.
any ideas why?
any pointers on using big matrixes and the problems they may cause?
(ps… and yes, i do have enough threads given to the kernel)