I’m trying to get L2 Cache Hit Rate using Nsight Compute for a simple CUDA code and I’m using the following Section file:
Metrics {
Label: "l2_hit_rate"
Name: "lts__t_sector_hit_rate.pct"
}
Metrics {
Label: "l2_tex_read_hit_rate"
Name: "lts__t_sector_op_read_hit_rate.pct"
}
Metrics {
Label: "l2_tex_read_transactions"
Name: "lts__t_sectors_srcunit_tex_op_read.sum"
}
Metrics {
Label: "l2_tex_write_hit_rate"
Name: "lts__t_sector_op_write_hit_rate.pct"
}
Metrics {
Label: "l2_tex_write_transactions"
Name: "lts__t_sectors_srcunit_tex_op_write.sum"
}
However depending on which of these metric I include the output of Nsight is different. Here for each result, those metrics that are not included have been commented out:
l2_hit_rate % 171.46
or
l2_hit_rate % 97.93
l2_tex_read_hit_rate % 4.75
l2_tex_write_hit_rate % 100
l2_tex_read_transactions sector 0
l2_tex_write_transactions sector 4
or
l2_hit_rate % 179.27
l2_tex_read_hit_rate % 2,418.75
l2_tex_write_hit_rate % 100
I was wondering what is going on here? More importantly what does larger than 100% mean? I’ve seen this behavior before for Utilization as well.
I have tried Nsight Compute 2019.5 and 2019.1 on two separate machine both running Ubuntu 18.04:
GPU:Titan RTX
Driver Version: 430.50
CUDA Version: 10.1
and
GPU: Quadro RTX 8000
Driver Version: 440.64
CUDA Version: 10.2