cuSolverSp_LowlevelCholesky fails with CUSOLVER_STATUS_INTERNAL_ERROR

openfresco · March 8, 2020, 4:29am

I was testing on the CUDA 10.2 example cuSolverSp_LowlevelCholesky (\CUDA Samples\v10.2\7_CUDALibraries\cuSolverSp_LowlevelCholesky) on GeForce GTX Titan X with 12288MB memory on Windows 10. I tried visual studio 2013 and 2019 and had the same observations.

With the lap2D_5pt_n100.mtx that comes with the example (10000x10000, 29800 non-zero items, file size about 402kB), the examples runs well with the following output.

GPU Device 0: “Maxwell” with compute capability 5.2

Using default input file [./lap2D_5pt_n100.mtx]
step 1: read matrix market format
sparse matrix A is 10000 x 10000 with 49600 nonzeros, base=1
step 2: create opaque info structure
step 3: analyze chol(A) to know structure of L
step 4: workspace for chol(A)
step 5: compute A = LL^T
step 6: check if the matrix is singular
step 7: solve Ax = b
step 8: evaluate residual r = b - Ax (result on CPU)
(CPU) |b - Ax| = 3.637979E-012
(CPU) |A| = 8.000000E+000
(CPU) |x| = 7.513384E+002
(CPU) |b - Ax|/(|A||x|) = 6.052497E-016
step 9: create opaque info structure
step 10: analyze chol(A) to know structure of L
step 11: workspace for chol(A)
step 12: compute A = LL^T
step 13: check if the matrix is singular
step 14: solve Ax = b
(GPU) |b - Ax| = 1.364242E-012
(GPU) |b - Ax|/(|A|*|x|) = 2.269686E-016

However, if I run the same executable with two other matrices: Matrix 1 (138507x138507, 6155289 non-zero items, file size about 138MB), Matrix 2 (109872x109872, 2311992 non-zero items, file size about 42MB), the program will terminate with CUSOLVER_STATUS_INTERNAL_ERROR.

The matrix files are available at the following link for your reference.

The following output is for Matrix 1 and the one for Matrix 2 is similar.

GPU Device 0: “Maxwell” with compute capability 5.2

Using default input file [./lap2D_5pt_n100.mtx]
step 1: read matrix market format
sparse matrix A is 138507 x 138507 with 6155289 nonzeros, base=0
step 2: create opaque info structure
step 3: analyze chol(A) to know structure of L
step 4: workspace for chol(A)
step 5: compute A = LL^T
step 6: check if the matrix is singular
step 7: solve Ax = b
step 8: evaluate residual r = b - Ax (result on CPU)
(CPU) |b - Ax| = 4.176400E-009
(CPU) |A| = 2.373223E+015
(CPU) |x| = 5.219332E-008
(CPU) |b - Ax|/(|A||x|) = 3.371697E-017
step 9: create opaque info structure
step 10: analyze chol(A) to know structure of L
step 11: workspace for chol(A)
step 12: compute A = L*L^T
step 13: check if the matrix is singular
CUDA error at cuSolverSp_LowlevelCholesky.cpp:327 code=7(CUSOLVER_STATUS_INTERNAL_ERROR) “cusolverSpDcsrcholZeroPivot( cusolverSpH, d_info, tol, &singularity)”

Could you please advise how to debug and resolve this error?

Topic		Replies	Views
cuSOLVER solving a overdetermined linear equation system GPU-Accelerated Libraries cusolver	8	1261	September 19, 2021
Abnormally slow performance GPU-Accelerated Libraries	3	839	February 4, 2019
Example using cusparse and cusolverSpDcsrlsvchol GPU-Accelerated Libraries cusolver , cusparse	10	74	May 1, 2025
Please help with cuSolver failure (CUSOLVER_STATUS_ALLOC_FAILED). GPU-Accelerated Libraries	4	1400	September 11, 2019
cusolverDnDpotrf does not support float CUDA Programming and Performance	0	526	August 30, 2017
cusolverSpScsrcholBufferInfo(...) crashes due to "Access violation reading location" GPU-Accelerated Libraries cusolver	2	644	October 12, 2021
Wrong output of Matrix Inversion using cuSOLVER GPU-Accelerated Libraries cuda	7	87	July 22, 2024
Matrix inverse usng linear system solver through cublas , cublasCreate exception or something else CUDA Programming and Performance	1	4615	June 16, 2013
cusparse Incomplete Cholesky CG - incorrect results GPU-Accelerated Libraries	9	3774	June 16, 2013
Packed matrix format for cuSOLVER Cholesky (potrf) GPU-Accelerated Libraries cublas , cusolver	0	10	January 28, 2025

cuSolverSp_LowlevelCholesky fails with CUSOLVER_STATUS_INTERNAL_ERROR

Related topics