Shader clock, core clock, memory clock

dtheodor · December 2, 2008, 12:42pm

Hello everyone,

First of all I would like to say that I just started reading about GPUs, so I am far from expert! :)

I keep reading about GPUs that are clocked with 3 different clock domains; shader, core and memory clock. Ok, memory clock is responsible for data transfers between the GPU and the off-chip memory, right? But what about the other two clock domains? I know that a GPU consists of a few multiprocessors, each one including a few processors. So, if I understand right, when we say shader_clock, we mean the one that goes to all multiprocessors and clock_core the one that goes to each processor inside the multprocessors?

Can anyone elaborate on this topic?

Kind regards,
dtheodor

seibert · December 2, 2008, 2:37pm

Actually, you have core and shader clock backwards. The core clock runs some functions on the multiprocessor level, like the instruction decoder, and the shader clock runs the individual processors. The shader clock is the fastest of the two, and this sets the speed of arithmetic operations by the processor.

As far as estimating the speed of a GPU, the core clock is not important.

dtheodor · December 2, 2008, 3:36pm

Thank you very much for your reply Seibert! This was very helful! External Media

alex_dubinsky · December 3, 2008, 5:36am

What I don’t understand is why the shader clock is 2.5x the core clock. I thought all the stuff with half-warps meant the ALUs were exactly twice the clock of the dispatcher/sram/etc.

xiaolaji · December 17, 2008, 1:46am

I am also a novice of cuda and gpu. I’m little confused of the memory clock. The new vesion Nvidia gpu card use the PCIE 2.0, which make the speed of data transfer twice. Does this speed means the transfer between host and device(gpu), and the memory bandwidth means the speed between global memory(device memory) and the shared memory(on-chip memory).

And why the shader_clock is twice of the core clock for certain gpu, and more than twice for others? External Media

_Big_Mac · December 17, 2008, 11:38am

PCIE version only affects host to device and device to host bandwidth. Everything you do inside a kernel is not affected by PCIE bandwidth at all.

Topic		Replies	Views
core clock or memory clock? CUDA Programming and Performance	1	3093	January 27, 2012
disparity between core clock and SMP clock CUDA Programming and Performance	3	2863	August 1, 2008
GPU vs. CPU GPU is always much slower CUDA Programming and Performance	1	10280	June 5, 2009
Significance of Multiprocessor Cores CUDA Programming and Performance	2	7680	February 17, 2011
compare performance across different GPU cards and how to figure out the frequency the GPU clock? CUDA Programming and Performance	4	9938	June 14, 2010
Frequency Vs Memory Bandwidth CUDA Programming and Performance	2	8821	September 5, 2009
Latency CUDA Programming and Performance	0	684	February 20, 2014
The GPU utilization is low CUDA Programming and Performance	3	2033	November 14, 2014
How to calculate the theoretical memory bandwidth? CUDA Programming and Performance	8	8598	December 18, 2024
Copies between CPU and GPU CUDA Programming and Performance	8	5360	November 3, 2009

Shader clock, core clock, memory clock

Related topics