Efficient computation for Sparse Matrix ( with CRS format ) ?

JaebeomPark · March 4, 2017, 3:17am

Hi,
I’m wondering how can I efficiently handling Sparse Matrices on TX1.
Including :

Parse CRS format in low-level CUDA layer ( Or, in other format )
Compute convolution of Sparse & Dense matrices with best speed & minimum memory

So, my questions are :

Is there any high level APIs for them?
If there exists, does open-frameworks, like caffe and TF, support them?
Does it speed-up compared to Dense&Dense matrices computation?
( I saw a few examples where sparse connection do not actually speed-up inference time because of several overheads. )

AastaLLL · March 6, 2017, 5:08am

Hi,

Thanks for your question.

We have cuSPARSE library contains some basic linear algebra subroutines used for handling sparse matrices.
Read more at: [url]http://docs.nvidia.com/cuda/cusparse/index.html#ixzz4aWEDdaET[/url]
AFAIR, TensorFlow also supports sparse tensor
https://www.tensorflow.org/versions/r0.11/api_docs/python/sparse_ops/
You can go their forum to get more details.
Speed-up ratio should depend on matrix sparsity. This page could give you some idea about performance.
cuSPARSE | NVIDIA Developer

JaebeomPark · March 8, 2017, 3:19am

Thank you, @AastaLLL.
Your information is great helpful!

Besides,
about the speed-up ratio with your link, I can’t find performance comparison of DenseDense vs DenseSparse on NVIDIA platform.
One another, the performance is measured with P100, then, can I expect similar effect on ‘TX1’?

If available, could you please send me more details of Sparse*Dense performance?

AastaLLL · March 10, 2017, 8:14am

Hi,

Thanks for your reply.

Speed-up may be related to matrix sparsity.
We don’t have comparison for DenseDense vs DenseSparse and tx1 version.

Sorry about this.
But you still can get some hint at cuSPARSE | NVIDIA Developer

Topic		Replies	Views
multi-threading with cuSPARSE lib GPU-Accelerated Libraries	15	1479	November 10, 2017
Cusp v0.1 release (Sparse Matrix Library) Cusp is a high-level library for sparse linear algebra and CUDA Programming and Performance	0	1525	May 4, 2010
Sparse tensor math speedup on Ampere TensorRT tensorrt , cuda	1	421	December 20, 2023
Exploiting NVIDIA Ampere Structured Sparsity with cuSPARSELt Technical Blog	10	1373	March 14, 2022
Implementing cuSPARSE within cuDNN Jetson TX1	4	1577	October 18, 2021
cuSPARSE with Tx1 Jetson TX1	4	936	February 25, 2016
cuSPARSE performance question: csrmm CUDA Programming and Performance	0	740	December 17, 2015
Sparse Matrix-Vector Multiplication on CUDA CUDA Programming and Performance	79	314289	November 22, 2010
cusparseLtMatmul is slower than cublasGemmEx GPU-Accelerated Libraries cublas , cusparse	0	653	April 21, 2023
cuSPARSELt v0.1.0 Now Available: Arm and Windows Support Technical Blog	0	441	April 23, 2021

Efficient computation for Sparse Matrix ( with CRS format ) ?

Related topics