cudaMallocManaged do not allocate on shared VRAM, but on dedicated VRAM

Ahessian · December 16, 2020, 9:05am

Despite it says that we have to call cudaMallocManaged to allocated on shared VRAM
Unified Memory for CUDA Beginners | NVIDIA Developer Blog
But cudaMallocHost do the job.

cudaMallocManaged :

cudaMallocHost :

I have a GTX 1060 3 GB

rreddy78 · December 18, 2020, 6:46am

I think it does not work in that way. By default it is attached to GPU VRAM. It is copied to host when required.
But you can also attach it to host initially if you like .Here is some code from an application note.

cudaHostAlloc does pinned zero-copy host memory allocation that is accessible from GPU.

 void MatrixMul(int hp, int hq, int wp, int wq)
 {
    int *p, *q, *r;
    int i;
    size_t sizeP = hp * wp * sizeof(int);
    size_t sizeQ = hq * wq * sizeof(int);
    size_t sizeR = hp * wq * sizeof(int);

    //Attach buffers ‘p’ and ‘q’ to CPU and buffer ‘r’ to GPU
    cudaMallocManaged(&p, sizeP, cudaMemAttachHost);
    cudaMallocManaged(&q, sizeQ, cudaMemAttachHost);
    cudaMallocManaged(&r, sizeR);

    //Intialize with random values
    randFill(p, q, hp, wp, hq, wq);

    // Prefetch p,q to GPU as they are needed in computation
    cudaStreamAttachMemAsync(NULL, p, 0, cudaMemAttachGlobal);
    cudaStreamAttachMemAsync(NULL, q, 0, cudaMemAttachGlobal);

    matrixMul<<<....>>>(p, q, r, hp, hq, wp, wq);
    
    // Prefetch 'r' to CPU as only 'r' is needed
    cudaStreamAttachMemAsync(NULL, r, 0, cudaMemAttachHost);
    cudaStreamSynchronize(NULL);
  }

Topic		Replies	Views
a question about cudaMallocManaged（） CUDA Programming and Performance	4	626	November 17, 2018
How to control where cudaMallocManaged allocates buffer (device or host) Nsight Visual Studio Edition cuda	0	411	March 23, 2020
Difference between cudaMallocManaged and cudaMallocHost CUDA Programming and Performance cuda	3	15317	March 30, 2022
Is cudaMallocHost allocated physical memory? CUDA Programming and Performance	6	1218	July 15, 2020
Xavier - Allocating aligned shared GPU-host memory Jetson AGX Xavier	2	1350	October 18, 2021
cudaMallocManaged with cudaMemAttachHost Jetson AGX Orin cuda	2	616	October 13, 2022
Is cudaMallocmanaged with cudaMemAttachHost flag is faster than malloc? Jetson TK1	1	1455	February 15, 2016
cudaHostAlloc vs cudaMallocHost vs cudaMallocManaged Jetson TK1	2	4177	October 20, 2016
cudaMallocManaged allocating more memory than requested CUDA Programming and Performance	7	3368	July 13, 2018
Managed memory vs cudaHostAlloc - TK1 CUDA Programming and Performance	10	6263	February 22, 2016

cudaMallocManaged do not allocate on shared VRAM, but on dedicated VRAM

Related topics