Memory clock rate

mahmood.nt · November 24, 2019, 8:42pm

According to the deviceQuery output, the memory clock rate for 2080Ti is 7000MHz

CUDA Driver Version / Runtime Version          10.0 / 10.0
      CUDA Capability Major/Minor version number:    7.5
      Total amount of global memory:                 10989 MBytes (11523260416 bytes)
      (68) Multiprocessors, ( 64) CUDA Cores/MP:     4352 CUDA Cores
      GPU Max Clock rate:                            1545 MHz (1.54 GHz)
      Memory Clock rate:                             7000 Mhz
      Memory Bus Width:                              352-bit

However, in websites such as https://www.techpowerup.com/gpu-specs/geforce-gtx-1080-ti.c2877 the memory clock rate is written as

Memory Clock
        1376 MHz
        11008 MHz effective

May I know how deviceQuery calculates that rate (or fetches from the device)?

Robert_Crovella · November 24, 2019, 8:59pm

(you are comparing 2 different GPUs here)

7000 MHz (for 2080Ti) is the equivalent double-pumped (DDR) rate. double-pumped means two bit transfers are happening per lane/wire, per clock.

11008 MHz (for 1080Ti) is the equivalent single-pumped rate. Converting that to a double-pumped rate would involve division by 2, so approximately 5500 MHz.

This would suggest that 2080Ti memory bandwidth is about 7000/5500 the memory bandwidth of 1080 Ti, since both involve a 352-bit bus width.

2080Ti:
https://www.techpowerup.com/gpu-specs/geforce-rtx-2080-ti.c3305
1080Ti:
https://www.techpowerup.com/gpu-specs/geforce-gtx-1080-ti.c2877

616/484 = 1.27 (memory bandwidth ratio)

7000/5500 = 1.27 (clock ratio)

The 1376 number is just the 11008 number divided by 8. All modern clocking systems involve a base frequency that is multiplied up to give the actual clocking observed on the bus. The multiplier here (8) isn’t that important, and neither is the base frequency. The effective frequency is what is most conveniently used to compute available bandwidth. Make sure to scale correctly for DDR or SDR effective, and if comparing clocks, make sure to compare the same clocks (DDR to DDR or SDR to SDR).

Bandwidth calculation (2080Ti example):

7000 (effective DDR MHz) * 352 (bus width in bits) * 2 (bits per DDR clock) / 8 (bits per byte) = 616GB/s (published 2080Ti memory bandwidth)

Topic		Replies	Views
How to calculate the theoretical memory bandwidth? CUDA Programming and Performance	8	8036	December 18, 2024
Memory Speed Calculation CUDA Programming and Performance	3	844	May 4, 2011
Rtx 3090 & a100 memory frequency CUDA Programming and Performance	2	2967	December 22, 2021
GeForce GTX 680 GPU Clock rate CUDA Programming and Performance	1	1449	May 22, 2012
Frequency Vs Memory Bandwidth CUDA Programming and Performance	2	8786	September 5, 2009
the theoretical device-device bandwidth CUDA Programming and Performance	6	3260	February 18, 2009
Does cudaDeviceProp.memoryClockRate always report DDR effective memory speed? CUDA Programming and Performance	1	618	January 6, 2017
9800 GTX bandwidthTest? CUDA Programming and Performance	5	7397	April 2, 2008
Bandwith Problem CUDA Programming and Performance	7	2633	March 16, 2009
CUDA deviceQuery and Bandwidth Test on Tx2 are weird Jetson TX2	10	2592	October 18, 2021

Memory clock rate

Related topics