I am currently running a program using a kernel that does operations on a worst case scenario to evaluate the maximum frame rate per second that we can get with our graphic card, a GeForce 8600.
We will be using, in our final product, a Fx 3600 and I was wondering if there was a way to predict, approximately, what will be the performances using this graphic card.
This specific kernel can do, using the CUDA Profiler, approximately 124 instructions by usecond. Knowing how many instructions I need to run the kernel, I can predict the time that it will take to run the kernel in the GPU in useconds on my GeForce 8600 and it seems quite accurate yet.
Knowing that the Fx 3600 has 12 multiprocessors and that the GeForce 8600 has only 4 of them, can I say that the number of operations by useconds could be around 3 times those of the GeForce, so about 372 instructions/useconds or is it a too simplist way to look at the card performances using CUDA ? :unsure:
If I’m wrong, then what elements would give me more details on the performances of my kernel under a FX 3600 ?
Thank you in advance !