Linear regression using CUDA

Hello CUDA programmers,

Is there a library to get regression coefficients (beta hat) of matrix X, y at once using CUDA library?

So far, I have found three components to calculate the coefficient : matrix multiplication (matmul), matrix inversion (cusolver) and matrix transposition.

It seems overwhelming to combine these tree components