architechture detail of implementation of local memory

Hi folks,
I would like to know about the architectural implementation of the local memory? Its slower than the shared memory according to the programming model, meaning its definitely off chip, but where?
Meenakshi Sundaram

local memory is “local” for thread, say it is private under programming view.

local memory locates in global memory (off-chip DRAM) physically.