I currently develop a process modelling, simulation and optimization software and I started porting SuperLU direct sparse linear equations solver to CUDA. It works for certain cases but there are few issues with synchronization between threads. I am looking for some people good in CUDA programming interested to help in resolving the issues so I guess this is a good place to ask for it. More information and the source can be found at DAE Tools project SuperLU_CUDA port.
Cheers