If you think you’re getting incorrect results, I suggest filing a bug. How to report a bug
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
The best input layout settings in CuBlas | 4 | 235 | August 27, 2024 | |
cuBLAS INT8 tensor core mode vs. FP16 mode | 13 | 5496 | December 5, 2022 | |
New cuBLAS 12.0 Features and Matrix Multiplication Performance on NVIDIA Hopper GPUs | 0 | 528 | February 1, 2023 | |
cublasGemmEx doesn't work with INT8 utilizing __dp4a instruction on NVIDIA 1080TI | 12 | 3656 | September 25, 2017 | |
cuBLAS INT8 tensor core mode vs. FP16 mode | 0 | 886 | February 15, 2019 | |
why the Tesla T4 peak performance test result mismatch with the official doc | 8 | 2488 | October 19, 2019 | |
Blackwell Integer | 139 | 2861 | June 26, 2025 | |
Is it correct that my Pascal card is calling Maxwell_gemm kernels through cublas? And if so, why is cublas unusably slow for me? | 6 | 949 | August 23, 2018 | |
Discrepancy Between Claimed and Actual Sparse INT8 Performance of Tensor Cores on Jetson AGX Orin | 15 | 400 | September 11, 2024 | |
cuBLAS convolution does not use Tensor Cores | 6 | 2237 | June 8, 2021 |