So, I guess 512 isn’t the right factor for a100 for both fp16 and in8. Please let us know what to use for a100 for different precisions.
My estimation is 4096 and apparently the user in the mentioned post suggests the same as well.
As of today, we don’t have a published roofline model for integer in Nsight Compute, including what factors and metrics would be needed. We have had this request before and are considering it along with other features.