I try to run a roofline analysis on some small test program. I found that "Device Memory Read Throughput" are quite different between each run. In an extreme case, it would be 0 B/s. I understand that the memory cod-state may affect the value, if I am right, including a lot of runtime effects. Therefore, is there an average, a mean value after several test runs, that I can use by turn on some option?