GPU memory hierarchy

How to categorize the on-chip memory and off-chip memory of the GPU.
Is there any tutorial to learn it?