cuSPARSE to solve multiple independent sparse linear systems in parallel

vincentroberge · March 1, 2014, 3:29pm

I have an application where I need to solve many power flow analysis which require solving many independent sparse linear systems. Each linear system have the same size and structure but different values in both the A matrix and B vector (let say we are solving A*X = B for X). Their size range from 30x30 to 3000x3000 with about 60 to 6000 non-zero elements mostly on the diagonal. I would like to use CUDA cuSPARSE to solve those linear systems on the GPU. If I had source code, I would use one thread block per linear system and my application would require 1000 threadblocks or so (for 1000 independent systems). The problem is that cusparse works at a higher level. I was thinking of two solutions:

solve each system sequentially on the GPU using different streams, hoping to get concurrent execution, but I doubt the performance will be good.
merge all my systems in a single large matrix and solve this matrix. This would create a very large matrix with all the original matrices positioned as blocks on the diagonal. Woud the performance be good?

Does anyone has a suggestion. Maybe there is another alternative. Maybe there are source code I could use and solve my system at the blockthread level? Any suggestions is appreciated. Thanks in advance.

CudaaduC · March 2, 2014, 6:29am

cuSparse(as far as I can tell) only has the A*X=B solver for sparse triangular matrices. There are two steps, one analysis and one solve.

[url]cuSPARSE :: CUDA Toolkit Documentation

As far as item number #2, this is something I am working on but for matrix-vector multiplications.
I am replicating matrices of the same size on the diagonal, and representing that block-diagonal matrix as sparse. Then I perform the matrix-vector multiply.

While I am using cuSparse more now, it is not easy to master.

Your matrices are small enough that you should consider cuBlas or even MAGMA.

MAGMA will be the best choice and but most solvers require you do a LU decomp or Cholesky factor first.

[url]http://icl.cs.utk.edu/projectsfiles/magma/docs/magma-v02.pdf[/url]

As far as streams go, they work but you need a high-end GPU and if the independent solves are not small, then it will not be able to parallelize the entire group.

vincentroberge · March 2, 2014, 6:14pm

Thank you, I really appreciate your comment. I will try both approaches and comments on the performance here.

CudaaduC · March 2, 2014, 9:20pm

Out of necessity I wrote a helper kernel which takes in a small dense matrix, and replicates across the diagonal in CSR sparse format.

Initially I tried creating a large block diag dense matrix(from the small matrices) then converting to CSR format using the cuSparse utilities. The problem was that the conversion process took too long so I wrote my own implementation which is much faster (but also takes advantage of the fact that I know the number of non-zeros ahead of time).

Let me know if you have any interest in that implementation. Really cuSparse is meant to handle very large but sparse data sets, so you probably will be better off using MAGMA, CULA or cuBlas.

CudaaduC · March 3, 2014, 1:45am

edit…

Topic		Replies	Views
cuSPARSE for solving Ax=b on matrix ~ 230400x230400 GPU-Accelerated Libraries	3	3679	December 31, 2015
Performance cusparseScsrsm_solve GPU-Accelerated Libraries	0	761	December 1, 2014
Use of CUSPARSE for AX=B CUDA Programming and Performance	11	7740	July 22, 2013
cusparse Ax=B? GPU-Accelerated Libraries	4	1239	February 28, 2014
Linear equations and sparse matrix CUBLAS, CUSPARSE, CULA CUDA Programming and Performance	2	2536	May 20, 2011
Example using cusparse and cusolverSpDcsrlsvchol GPU-Accelerated Libraries cusolver , cusparse	7	44	October 8, 2024
Banded sparse matrix linear eqution solve with CUDA GPU-Accelerated Libraries	0	2221	January 16, 2013
How to solve a system of linear equation(matrix of the order of 10000*100000) using CUBLAS GPU-Accelerated Libraries	3	1236	October 16, 2014
cublas solving a linear system GPU-Accelerated Libraries	4	2518	June 28, 2017
Cusparse for solving the sparse linear equation Ax=b Legacy PGI Compilers	8	1976	August 30, 2019

cuSPARSE to solve multiple independent sparse linear systems in parallel

Related topics