Question about Memory Workload Analysis.
does there exist any documentation about Memory Workload Analysis?
Especially I want to know the keywords in map. It seems very helpful but I cannot found documentation in Nsight Compute :: Nsight Compute Documentation
I am attached the samples on Nsight Compute with running tf32TensorCoreGemm in cuda-samples.
ncu_cudasamples_tf32.pdf (1.2 MB)