Confusion in bached Cholesky Factorization.

czhm13 · April 2, 2019, 2:04pm

Hi, I am reviewing the bached Cholesky Examples (E.1 bached Cholesky Fracorization) in cuSOLVER. And I am little confused about part of code.

The code blow declare 2 host arrays A0 and A1 along with a *Aarray. I assume *Aarray is located in the device since it is malloced using cudaMalloc.

double A0[lda*m] = { 1.0, 2.0, 3.0, 2.0, 5.0, 5.0, 3.0, 5.0, 12.0 };
    double A1[lda*m] = { 1.0, 2.0, 3.0, 2.0, 4.0, 5.0, 3.0, 5.0, 12.0 };

    double *Aarray[batchSize];

    for(int j = 0 ; j < batchSize ; j++){
        cudaStat1 = cudaMalloc ((void**)&Aarray[j], sizeof(double) * lda * m);
    }

cudaStat1 = cudaMemcpy(Aarray[0], A0, sizeof(double) * lda * m, cudaMemcpyHostToDevice);
    cudaStat2 = cudaMemcpy(Aarray[1], A1, sizeof(double) * lda * m, cudaMemcpyHostToDevice);

The next part is where it confuse me, the d_Aarray is copied from Aarray using cudaMemcpyHostToDevice flag which means Aarray is located in the host, which is contradicted to the content above.

cudaStat1 = cudaMemcpy(d_Aarray, Aarray, sizeof(double*)*batchSize, cudaMemcpyHostToDevice);

Could someone tells me where is the disconnect here? Very appreciated!

Robert_Crovella · April 2, 2019, 4:24pm

I assume the answer here will clear it up:

[url]https://devtalk.nvidia.com/default/topic/1049419/gpu-accelerated-libraries/way-to-covert-pointer-d_a-to-array-d_array-/[/url]

Topic		Replies	Views
Way to covert pointer (d_A) to array (d_Array []) GPU-Accelerated Libraries	4	592	April 3, 2019
allocating double pointer memory in GPU CUDA Programming and Performance	3	11762	February 3, 2011
need some help with cudaMemcpy/cudamemcpy2D CUDA Programming and Performance	2	1980	June 9, 2010
cudaMalloc causes segmentation fault 2 Mo is far from my 1,2 Go card memory limit CUDA Programming and Performance	7	7469	June 28, 2011
cudaErrorInvalidValue when copying from host to device CUDA Programming and Performance	1	9817	November 24, 2009
cublas_cublasDgetrsBatched_problem GPU-Accelerated Libraries cublas , cusolver	5	931	October 6, 2022
Multi-GPU array CUDA Programming and Performance	2	572	June 4, 2021
How to use cudaMalloc3DArray to copy dynamic 3d array from host to device? CUDA Programming and Performance	2	3070	September 2, 2011
Using unified memory for 2Dim and 3 Dim array CUDA Programming and Performance	2	529	November 18, 2018
Copying to 2D cuda array source array is smaller than allocated one CUDA Programming and Performance	5	5440	April 17, 2009

Confusion in bached Cholesky Factorization.

Related topics