Batch Matrix Multiplication using CuBLAS

darshancganji12 · February 18, 2021, 8:02pm

Hi Nvidia Team,

Actually, I am working on registering a Plugin for an Operator(Einsum) which is not currently supported in TensorRT. So, instead of implementing a CUDA Kernel, I want to use the CuBLAS Library for Batch Matrix Multiplication.

The Equations I want to implement is(from Einsum Operator):
“ntg, ncg → nct” and " nct, ncp-> ntp" (for Batch Matrix Multiplication)

Info about Einsum op: onnx/Operators.md at master · onnx/onnx · GitHub

I needed a guidance in using CuBLAS Library for Batched Matrix Multiplication for the above two Ops.

I am referring to the Available references(cuBLAS :: CUDA Toolkit Documentation, Pro Tip: cuBLAS Strided Batched Matrix Multiply | NVIDIA Developer Blog), but I am not getting how to use it for the above Equations.

Can you please assist me for the same?

Thanks in Advance,
Darshan C G

Robert_Crovella · February 19, 2021, 8:54pm

Topic		Replies	Views
Batch Matrix Multiplication using CuBLAS GPU-Accelerated Libraries tensorrt , cuda , kernel , c-plus-plus	17	3333	March 2, 2021
Optimizing Sequential cuBLAS Calls for Matrix Operations—Alternatives to Kernel Fusion? GPU-Accelerated Libraries cublas	3	416	April 29, 2024
CUTLASS: Fast Linear Algebra in CUDA C++ Technical Blog	0	421	August 21, 2022
cuBLAS convolution does not use Tensor Cores GPU-Accelerated Libraries cublas	6	2170	June 8, 2021
Non Square Matrix Multiplication on CUDA Matrix Multiplication Help CUDA Programming and Performance	7	4928	June 24, 2009
Incorrect results when using cublas matrix multiplication GPU-Accelerated Libraries	1	1510	April 28, 2016
Matrix multiplication CUDA Programming and Performance	3	3786	March 6, 2008
Faster MatrixMult than CUBLAS! CUDA Programming and Performance	4	2789	September 4, 2009
CUBLAS matrix multiplication matrix size limited by GPU memory size CUDA Programming and Performance	8	3458	August 2, 2010
Optimising multiple calls to cuBLAS GPU-Accelerated Libraries	1	811	July 23, 2017

Batch Matrix Multiplication using CuBLAS

Related topics