CudaMallocAsync-cudaFreeAsync

CisMine · August 21, 2023, 6:17pm

in cuda code we have cudaMallocAsync and cudaFreeAsync which mean we dont need to allocate the size equal of the data size, we can decrease the size right ( reuse the data )
for example we got array 10 element but we can allocate the size with just 5 element and reuse it
can we?

Robert_Crovella · August 21, 2023, 7:14pm

You can always reuse an allocation (as long as you haven’t freed it.) Beyond that, I’m not sure what you mean by

If you allocate space for 5 elements, then you can store the first half of the 10 element array there, and later on store the second have of the 10 element array there.

You can do the same thing with cudaMalloc. So I may not be grasping your question.

CisMine · August 21, 2023, 7:53pm

For example, I have an array with 100 elements in cpu, I need to copy that array to GPU for computing something.

In normal way when use cudaMalloc, there’re 2 ways:

allocate 100elements once
allocate 50elements for computing and copy H2D then reuse that space to allocate 50 rest.

In second way, (not compare the speed of these ways) we just need to spend 50 * sizeof(int) space which is better than 100 * sizeof(int) space ( in first way) BUT in both ways we need to take 2 time unit to allocating 100elements ( example allocate 50 elements need 1 time unit)

So we can improve the second way by using cudaMallocAsync which mean we overlap the time to allocate elements. I think I just answer my question. Thks

Topic		Replies	Views
Asynchronous cudaMalloc CUDA Programming and Performance	3	11882	July 2, 2012
Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 1 Technical Blog	1	714	September 13, 2024
Asynchronous cudaMallocFree/cudaFreeAsync per GPU? CUDA Programming and Performance	1	64	February 3, 2025
Using the NVIDIA CUDA Stream-Ordered Memory Allocator, Part 2 Technical Blog	12	1416	September 12, 2023
The impact of cudaMalloc(）and cudaFree() on the overlapping of kernel executions and data transfer CUDA Programming and Performance	0	1021	July 22, 2020
Asynchronous problem with cudaMalloc CUDA Programming and Performance	2	1043	May 22, 2023
cudaStream alloc after free result in oom CUDA Programming and Performance	8	166	January 1, 2025
Can cudaFreeAsync be used to free unified memory allocated with cudaMallocManaged? CUDA Programming and Performance cuda	2	59	April 26, 2025
Why is there no `cudaMallocArrayAsync`? CUDA Programming and Performance cuda	1	98	April 14, 2025
cudaHostAlloc memory initial time CUDA Programming and Performance	0	388	August 19, 2018

CudaMallocAsync-cudaFreeAsync

Related topics