From Programming guide:
"Kepler serviced loads at a granularity of 128B when L1 caching of global loads was enabled and 32B otherwise. On Pascal the data access unit is 32B regardless of whether global loads are cached in L1. So it is no longer necessary to turn off L1 caching in order to reduce wasted global memory transactions associated with uncoalesced accesses. "
Maxwell had 128byte L1 cacheline and 32Byte L2 line right ?
Does this mean in Pascal L1 we get only latency advantage for data access. Bandwidth remains the same as L2.