cuBLAS GEMM INT8 is much slower than FP16 in T4

If you think you’re getting incorrect results, I suggest filing a bug. How to report a bug