Kernel-level cuBLAS

Daniel_Wong · April 28, 2021, 11:00pm

Hi, All

Is there any kernel-level cuBLAS API, that we can use at the warp or block level?

I want to run many matrix-matrix multiplications inside a GPU kernel (__global__) function, therefore, I need the API to invoke the cuBLAS from a thread/warp/block.

Thanks

mnicely · April 28, 2021, 11:34pm

Hi Daniel_Wong,

Quick answer is no, but we are working on a new library device side cuBLAS library that should be available through Math Library Early Access Program, later this year. Sign up for updates here CUDA Math Library Early Access Program | NVIDIA Developer

Just curious, what size matrices are you interested in?

In the meantime, you might want to check out CUTLASS to see if it can satisfy you needs. GitHub - NVIDIA/cutlass: CUDA Templates for Linear Algebra Subroutines

Daniel_Wong · April 29, 2021, 4:46pm

The GEMM size for each warp is around 10000x128x128 (MxNxK).

Thanks!

Topic		Replies	Views
Calling cuBLAS from device? GPU-Accelerated Libraries cublas	4	765	April 26, 2023
cuBLAS for lower-end GPUs GPU-Accelerated Libraries	1	573	May 20, 2016
CUBLAS matrix multiplication matrix size limited by GPU memory size CUDA Programming and Performance	8	3438	August 2, 2010
Is there any linear algebra library available for coding CUDA kernels? In developing of CUDA code, I CUDA Programming and Performance	5	1381	July 2, 2012
Using gcgemm from CuBLAS CUDA Programming and Performance	1	704	March 23, 2020
Question about cuBLAS library internal kernels and their memory location GPU-Accelerated Libraries	0	372	March 27, 2019
How does CuBLAS use Gpu multi-core? CUDA Programming and Performance	5	7716	February 6, 2011
Multiple Cublas functions on single GPU CUDA Programming and Performance	5	1702	August 8, 2010
Is there any device function for matrix operation? CUDA Programming and Performance	2	817	April 13, 2012
Optimizing Sequential cuBLAS Calls for Matrix Operations—Alternatives to Kernel Fusion? GPU-Accelerated Libraries cublas	3	387	April 29, 2024

Kernel-level cuBLAS

Related topics