I would like to know if the dram transaction metrics, e.g. dram__sectors_read.sum
are at device or sm level?
For example, I see
dram__sectors_read.sum sector 359020.000000 362896.000000 359714.383838
smsp__thread_inst_executed.sum inst 3318416002.000000 3517973660.000000 3431244663.151515
Since thread_inst_executed is at smsp level, if the dram metric is at device level, I have to calculate NUMBER_OF_SMS * 4 * smsp_thread_inst_executed to see how many instructions were executed on the device with respect to the number of dram read transactions. Is this argument correct?