Difference between the csrgemm() and csrgemm2()

GD_06 · March 13, 2016, 4:13am

Hi all,

I am applying cusparse function to my application recently to accelerate the SpGEMM. When I went through the documentation, I noted that there are two functions, csrgemm() and csrgemm2() to accomplish this task.

However, I am not quite understand any difference, especially in terms of performance, between this two functions. Did anyone do experiments and can explain it to me? I am quite grateful to your generous help.

Regards,
GD_06

Robert_Crovella · March 14, 2016, 11:59am

The difference is discussed in the documentation:

http://docs.nvidia.com/cuda/cusparse/index.html#cusparse-lt-t-gt-csrgemm2

“We provide csrgemm2 as a generalization of csrgemm. It provides more operations in terms of alpha and beta. For example, C = -A*B+D can be done by csrgemm2.”

So csrgemm2 does operations in a single library call that csrgemm cannot. csrgemm2 provides support for alpha and beta so that you can do this:

C = alpha ∗ A ∗ B + beta ∗ D

whereas csrgemm can only do this:

C = op ( A ) ∗ op ( B )

If you have an operation that can be realized using csrgemm, it’s unlikely to be faster using csrgemm2. It’s likely that csrgemm2 was added to the api after csrgemm, and so csrgemm was kept for compatibility reasons. It may also be the case that csrgemm is slightly faster if you don’t need an alpha and a beta.

GD_06 · March 17, 2016, 1:06pm

Thanks txbob!

Topic		Replies	Views
cusparse csrgemm2 and half precision GPU-Accelerated Libraries	4	558	September 10, 2019
cuSparse: cusparseScsrgemm2 much slower than SpGEMM GPU-Accelerated Libraries cusparse	3	1220	June 24, 2021
Difference between CuSparse csrsv* and csrsv2* GPU-Accelerated Libraries	4	2482	June 14, 2014
Slow cusparseDcsrgeam in CUDA 9.2 GPU-Accelerated Libraries	4	731	August 17, 2018
Significant performance decrease from cusparseZ(D)csrmm2 to cusparseSpMM GPU-Accelerated Libraries	2	400	February 1, 2021
CUDA_func & CU_func CUDA Programming and Performance	2	1952	April 21, 2008
cusparseScsrmv performance CUDA Programming and Performance	3	1858	January 4, 2011
CUDA 2.0 beta impressions CUDA Programming and Performance	8	4985	April 26, 2008
GPU cryptography speedup CUDA Programming and Performance	5	9170	October 17, 2008
What 'cusparseXcscsort' or 'cusparseXcsrsort' use for? GPU-Accelerated Libraries cuda , cusparse	4	647	June 30, 2023

Difference between the csrgemm() and csrgemm2()

Related topics