If I compile my code with nvcc on a fedora 9 box that has a quadro fx570 and then transfer and run the executable on a fedora 8 machine that has a Tesla server attached can I expect the executable to run at full speed???
I guess I’m asking does nvcc compile for my quadro FX570 that has 2 MPs adn so will only use 2 MPs on the tesla card???
I see about 3 times increase in peformance when running on the tesla compaired to my quadro. I was expecting a little more. I haven’t multiGPU’d the code so I was expecting roughly a 8x (16MP/2MP) performance increase.
My kernel launches around 37K threads so I guess this would be enough to fill the tesla’s 128 scalar processors (on 1 GPU)???