I’m trying to run a few of the benchmarks from the Rodinia suite through computeprof on a GeForce GTX 680 (CUDA 5.0). The issue I’m running across is that all of the L1 cache statistics are always appearing as 0. I’ve also tried running the BlackScholes application from the CUDA (5) SDK suite, and the stats for it for the L1 cache are also 0 for everything except the local load/store misses.
My question is: is there a specific flag/switch I need to set to get the L1 cache statistics to appear? Is it just a feature of these benchmarks I’ve selected (bfs, backprop, BlackScholes) that they happen to have no L1 traffic?