Hi
May I know what is the description of sass__inst_executed_global_loads? Is that a thread-based counter for executed global loads? I didn’t find that neither in the following commands nor in the manual page.
It is correct that sass__inst_executed_global_loads (and similar metrics used by e.g. the Memory Workload Analysis tables) are not listed with the --query-metrics feature. The reason is that they are generated by a different provider internally. Similarly, device__attribute_* metrics would not be listed their, either. Most metrics however will be part of the --query-metrics output.
As indicated by the name, the metric is based on SW-patches, i.e. the kernel code is instrumented at runtime to count a specific property, in this case the number of global load instructions. The counter is incremented for every individual executed instruction, i.e. per GPU thread. It is used for the L1/TEX Cache table’s “Global Load Instructions” cell.