Hi everybody,
I ran into a strange issue, I developed a cuda app on a gtx 260, I bought a new gtx 465 a few days ago and the performance was the same. I checked the samples from sdk and things like scan eigenvalues trigger even wouse results.
gtx 260:
SCAN
scan-Large, Throughput = 205.5178 MElements/s, Time = 0.00128 s, Size = 262144 E
lements, NumDevsUsed = 1, Workgroup = 256
eigenvalues
Iterations to be timed: 100
Result filename: ‘eigenvalues.dat’
Gerschgorin interval: -2.894310 / 2.923303
Average time step 1: 15.531688 ms
Average time step 2, one intervals: 4.953684 ms
Average time step 2, mult intervals: 0.017448 ms
Average time TOTAL: 20.831189 ms
gtx 465
SCAN
scan-Large, Throughput = 203.7271 MElements/s, Time = 0.00129 s, Size = 262144 E
lements, NumDevsUsed = 1, Workgroup = 256
Result filename: ‘eigenvalues.dat’
Gerschgorin interval: -2.894310 / 2.923303
Average time step 1: 37.504597 ms
Average time step 2, one intervals: 12.808976 ms
Average time step 2, mult intervals: 0.019804 ms
Average time TOTAL: 50.474590 ms
I use the latest version of the drivers, the only thing difference is the OS, the gtx 465 is running on Windows Server 2003 64bit.
Can some help me in this matter? Maybe the fact the 465 has less SM?
In the device query from SDK has only 11 SMs compared with the gtx 260 with 27 SMs.
Best regards,