The meaning of "machine accuracy"

spraesi · March 21, 2025, 5:56pm

Hello.
I am using the cuSOLVER eigen-decomposition function cusolverDnCheevjBatched, and I see the documentation says the default value for the tolerance is “machine accuracy”.

I am not sure exactly what that means, and since there is no function to read it but only a cusolverDnXsyevjSetTolerance to set it, I cannot know for sure.
Does anyone have an idea of what the machine accuracy is? I assume it’s probably 2^-24 for single precision.

christophk · March 21, 2025, 6:04pm

Hi Spraesi,

may I ask why you use CheevjBatched and not cusolverDnXsyevBatched? What is your typical input matrix size?

spraesi · March 21, 2025, 6:35pm

It’s just because I am targeting CUDA 11.8 (October 2022) and above.
I call CheevjBatched to speed up computations in MATLAB R2024b, which supports CUDA 12.2 “natively”, and the input is a 351x351x24 tensor.

Seems cusolverDnXsyevBatched was added quite recently with CUDA 12.6.2 (October 2024), but I am really curious if it is faster than CheevjBatched and to see if it does not have the issue mentioned in syevjBatched cannot be run asynchronously (because it seems I can at least pin the host-side workspace memory).
I just haven’t had time to check it out yet, and I’ll proably wait till a MATLAB release, where it is “natively” supported, so I do not have to compile statically with cuSOLVER (to avoid conflicting libraries) to use it.
Should I expect a speed-up using cusolverDnXsyevBatched over CheevjBatched?

spraesi · March 26, 2025, 12:49pm

In the documentation for syevj, it states

If the user sets an improper tolerance, syevj may not converge. For example, tolerance should not be smaller than machine accuracy.

And in the corresponding code example, it says in a comment “default value of tolerance is machine zero”.

So this implies the default tolerance value is some non-zero value, probably 2^-23 for floats.

The new API cusolverDnXsyevBatched does not support setting the tolerance, which is also a shame since increasing the tolerance greatly improved the speed without reducing the quality in my case.

Topic		Replies	Views
cuSolver big deviations from lapack GPU-Accelerated Libraries	5	2515	May 1, 2017
Syevj performance GPU-Accelerated Libraries cusolver	3	1089	May 20, 2021
CUSOLVER_STATUS_NOT_INITIALIZED at cusolverDnCheevjBatched_bufferSize GPU-Accelerated Libraries cuda	0	601	June 15, 2023
Gesvdj vs gesvda vs syevj for finding eigenvalues/singular values GPU-Accelerated Libraries cusolver	2	981	July 16, 2025
How to run eigen decomposition of multiple Hermitian matrices in parallel? GPU-Accelerated Libraries cusolver	0	1149	March 4, 2022
Cusolver syevd samples has different output from official GPU-Accelerated Libraries cusolver	2	468	December 16, 2022
cusolver: no batched version of 'cusolverDnSsyevd' library to solve large small symmetric eigenvalue and eigenvector GPU-Accelerated Libraries	0	440	June 19, 2017
Batched SVD with Cusolver routine cusolverDn<t>gesvdjBatched - how to use ? GPU-Accelerated Libraries	1	1279	July 21, 2018
Help Improving Performance using cuSolver/cuSparse Routines GPU-Accelerated Libraries cuda , nsight , performance , python , pycuda	0	716	December 15, 2023
Limitations of cusolverDn<t>syevd() GPU-Accelerated Libraries	1	470	January 30, 2025

The meaning of "machine accuracy"

Related topics