i’d like to calculate the maximum memory speed which my card can achieve.
Assume i have the G80 with a memory bandwidth of 384bit and a memory clock of 900MHz.
To get the GB/s i’d simply
. 8 for bit to byte, 1024 for byte to megabyte and so on, but the number seems ridiculously small.
In a the CUDA parallel reduction whitepaper they made a sample calculation with the same graphics card and came up with 86.4 GB/s. Calculated with
384 * 1800/8
Could someone explain me, how this makes any sense?? The unit of this is byte/s. And why multiply the memory clock by 2?
Thanks and best regards, tdhd