IÂ´m running a parabolic PDE simulation code using CUDA in a GTX 285 and everything was fine until I got a GTX 470. I was expecting to have a significant improvement in performance, but that was not the case. I noticed that for the task of solving linear systems at every time step, both GPUs had almost the same performance. I use finite elements and the resulting matrix is sparse and I store it using the DIAGONAL format, since my mesh is structured. IÂ´m using single precision and I’m measuring the elapsed time (ET) to solve the linear systems and when I compare the ET for 470 and 285, they are very close…
My question is why the performance of both GPUs are so close?
I started to think that this is because in GTX 285 we have 30 MPs with 8 SP in each, whereas in GTX 470 we have only 14 with 32 SP in each MP…and somehow one thing “compesates” the other. Is that true?