Linear Algebra Solvers

gerdw · March 20, 2007, 2:24pm

Does anyone have any plans to produce generic, optimized linear algebra solvers using CUDA (think of some of the solvers in IMSL)? For example, a tri-diagonal solver, a sparse matrix solver, or eigenvalues would be very useful to a large class of problems.

The CUBLAS library is a great step in the right direction. :)

Uliveto · August 2, 2007, 1:17pm

Yes I do agree, and I am searching for a CUDA implementation of a linear system solver… Did you do any progress? I’ll let you know mines

jachung · August 14, 2007, 12:45am

In fact, I am considering a development of a sparse matrix package. I am looking into SPARSE1.4 and UMFPACK5.1 that I am familiar with. I believe the fact that sparse matrix solver is available is pivotal to further exploitation of GPU hardware acceleration in scientific computing community.

HoKru · February 21, 2008, 8:25am

Just wanted to push the topic a bit.

Something like the LAPACK routines for linear equations are on top of my wish list…

(btw, thx for CUBLAS!)

greetings

pacodani · February 28, 2008, 11:36am

You can find some dense results at the CUDA showcase (Page Not Found | NVIDIA).

HoKru · February 29, 2008, 8:43am

Thx, I must have overseen some of the papers in the showcase

shong · November 18, 2008, 6:56am

do you have some idea about the sparse matrix iterative solver?

toddwz · January 22, 2009, 8:02pm

I am also interested in iterative solvers, especially Krylov subspace methods.

jack · January 22, 2009, 9:53pm

CUBLAS 2.0 should be released soon, which may have some more methods in it that could be used to build a linear algebra solver. As for specifically-optimized kernels (e.g. tridiagonal, sparse, etc.), I don’t think there is anything official coming from nVidia, but I know that some people are working on their own implementations of such kernels (like myself) – though I don’t think there’s anything out to the public yet.

Gregory_Diamos · January 22, 2009, 9:57pm

Shameless plug: http://gpu-vsipl.gtri.gatech.edu/

nitin.life · January 29, 2009, 2:46am

I am planing to make a CONJUGATE GRADIENT (krylov types) …(plus others solvers in CUDA . Can some one let me know if such kind of work has been done previously ?

Thanks all

erdooom · January 29, 2009, 8:00am

Yes there have been a few trys in that direction. usually the tricky bit is the vector matrix multiply and the large vector dot product. Both of which have nice implementations in the cuda demos and in cudpp. We have our own internal solver written completely in cuda with a very nice preference boost. So it can work. though i can’t share it with you guys … sorry :)

nitin.life · January 30, 2009, 11:57pm

okay thanks for that info External Image … can you throw some light on the parallel strategy you used or is that also restricted ? External Image External Image

dneckels · February 2, 2009, 9:13pm

Hi,

I just finished up my GMRES solver this weekend.

Its very fast, but suffers the obvious problem: no preconditioners!!! This is the next
step, but I’m not sure what type of preconditioners I might be able to compute quickly enough (using, of course,
the GPU). My favorite is ILUT, but can this be thread parallelized???

Let me know, I have no problem providing the code (I think you can email using the board, eh?)

This would be a great project that I would like to collaborate on; iterative solvers and preconditioners. Sort of the Aztecoo of CUDA.

mfatica · February 2, 2009, 9:29pm

You could attach the code to your post.
Nice picture, is that a separation in a diffuser?

dneckels · February 2, 2009, 9:58pm

GMRES code:

I created a google code project and added the solver.
Its ugly, but it works and might be a start to something more???

It can be accessed by svn:

Non-members may check out a read-only working copy anonymously over HTTP.

svn checkout [url=“http://cudaztec.googlecode.com/svn/trunk/”]http://cudaztec.googlecode.com/svn/trunk/[/url] cudaztec-read-only

If anyone wants, I would be happy to add them and make this into something.

Let me know if there are problems with the svn checkout.

applehu · February 2, 2009, 11:11pm

Thank you very much for sharing the code. Could you please give some performance data of the code?

Thanks!

dneckels · February 2, 2009, 11:19pm

Again, the code is very rough for the moment!

I don’t have time to generate a lot of performance data; perhaps you could contribute some? I’d be happy to post results at the google code site!

My preliminary tests (and the code will go through this) is that the matrix multiply is (on my GeForge 9800M card) 369 times faster than the cpu. Hence iteration proceeds very quickly.

The downside is that without preconditioning, there isn’t yet a win, because it takes so many iterations to solve anything!!

My hope is to generate some interest and maybe get some people helping out and writing some preconditioners???

And to talk about some design ideas…

So, please, let me know what you think!

applehu · February 2, 2009, 11:48pm

Thank you for the info. The matrix I’m working on is relatively small and dense. So I’m still trying to figure out if I should focus on direct solver or try iterative solver. The speedup you showed is quite impressive and you are right that the preconditioner will be very important.

dneckels · February 3, 2009, 4:14pm

I’ve updated the code to be slightly less ugly, including checks for convergence, max iters, etc…
I also uploaded a smaller matrix for testing. On to working on ILUT…

http://cudaztec.googlecode.com

(cuda GMRES solver)

Topic		Replies	Views
CULA Sparse 1.0 now available Sparse Iterative Solver Package CUDA Programming and Performance	3	2276	November 7, 2011
CULA Sparse (beta) taking applications gpu-accelerated sparse linear algebra CUDA Programming and Performance	1	2312	August 26, 2011
CULA Sparse (beta) taking applications gpu-accelerated sparse linear algebra CUDA Programming and Performance	1	734	August 26, 2011
Solutions of linear equations CUDA Programming and Performance	7	5479	July 16, 2010
Is there library or code to solve Large Sparse Linear system or do Large sparse Matrix Multiplicatio CUDA Programming and Performance	4	3361	April 28, 2009
Linear equations and sparse matrix CUBLAS, CUSPARSE, CULA CUDA Programming and Performance	2	2576	May 20, 2011
Sparse Matrix solvers with CUDA Do these solvers exist? CUDA Programming and Performance	8	32504	October 15, 2010
Complex Sparse Matrix Linear Solver CUDA Programming and Performance	0	2303	December 13, 2008
Cusp v0.1 release (Sparse Matrix Library) Cusp is a high-level library for sparse linear algebra and CUDA Programming and Performance	0	1512	May 4, 2010
Parallel Preconditioners for CG calculating the "inverse" in parallel CUDA Programming and Performance	2	3941	April 7, 2010

Linear Algebra Solvers

Non-members may check out a read-only working copy anonymously over HTTP.

Related topics