cublasSgemm - is there a way to choose algorithm
|
|
6
|
67
|
August 15, 2022
|
Run Parallel Tensor Cores GEMM and Cuda GEMM
|
|
9
|
706
|
August 14, 2022
|
Matrix multiplication and transpose in row-major matrices
|
|
1
|
127
|
July 13, 2022
|
cublasSdot_v2() gives different results when running on different GPU types,
|
|
3
|
169
|
July 5, 2022
|
Cublas Bug
|
|
8
|
284
|
June 21, 2022
|
What does "sliced1x4_nn" mean in matmul?
|
|
0
|
186
|
June 17, 2022
|
Calling cuSolverSp from CUDA FORTRAN
|
|
3
|
243
|
June 13, 2022
|
cusparseScsrilu02 breaks with large matrices
|
|
10
|
325
|
June 1, 2022
|
Undefined reference to `cublasCreate_v2'
|
|
14
|
23909
|
May 11, 2022
|
[Urgent] Can I use cuBLAS functions in multicore CPU parallelism with OpenACC?
|
|
2
|
197
|
May 4, 2022
|
Error Internal: Blas GEMM launch failed
|
|
3
|
718
|
April 30, 2022
|
How to serialize cublasLtMatmulAlgo_t
|
|
2
|
215
|
April 20, 2022
|
Fixed Point cublasCgemm?
|
|
1
|
197
|
April 13, 2022
|
Static linking cublas
|
|
4
|
388
|
April 5, 2022
|
Having multiple relatively small problems
|
|
5
|
315
|
April 7, 2022
|
When cudaFree() will be called
|
|
3
|
360
|
March 20, 2022
|
NVIDIA_TF32_OVERRIDE=0 not disabling TF32 in cublas
|
|
7
|
611
|
January 13, 2022
|
cuSOLVER/cuBLAS solving of LUx=b with batched interface
|
|
2
|
319
|
January 7, 2022
|
Incorrect call to smbv
|
|
4
|
286
|
December 31, 2021
|
Factorization for batched matrices
|
|
2
|
287
|
December 30, 2021
|
Fp32 & a100
|
|
3
|
253
|
December 16, 2021
|
cuBLAS causes 7% performance drop in subsequent kernels
|
|
0
|
252
|
October 28, 2021
|
Tensorrt take much more cpu ram in RTX3070
|
|
7
|
736
|
October 15, 2021
|
High Memory Usage in cuSOLVER unpivoted QR (geqrf)
|
|
0
|
222
|
October 8, 2021
|
Adapt FP32 operation with TF32
|
|
4
|
242
|
October 7, 2021
|
Mixed Precision Algorithm in Ampere Is Slower Than Volta
|
|
1
|
220
|
October 6, 2021
|
cublasGemmEx is a Tensor Core operation or CUDA core?
|
|
3
|
253
|
October 3, 2021
|
Parallel execution of GEMM with other Operations
|
|
3
|
242
|
September 18, 2021
|
Disable Tensor Cores in cuBLAS functions explicity
|
|
4
|
675
|
January 28, 2022
|
Covariance of 3 dimensional sparse boolean array in Cublas/cuSPARSE
|
|
0
|
336
|
September 10, 2021
|