Is cudaMalloc slow when called multiple times?

bjfujsy · July 5, 2024, 4:21am

Hi, I have a application that multiple cudaMalloc calling is needed. And during the runtime, I found that it would be extremely slow (about 200ms) a few times and lowered the whole time. As I know it would happen in the first time to call it, but why does it happen in the middle of runtime ?

NVIDIA L20
Driver Version: 550.54.15
CUDA Version: 12.4

Curefab · July 5, 2024, 8:44am

Hi bjfujsy,
cudaMalloc is generally not advised to be called, when your program reaches the part that is performance critical. Try to allocate all the memory upfront. However, 200ms is quite a lot. Is your memory nearly full or are you allocating a huge amount of memory?

bjfujsy · July 5, 2024, 10:11am

Hi, it has a lot of free memory and the size of allocating memory is about 3.5M. I don’t think it is huge. Is there any other possible reason ?

cbuchner1 · July 5, 2024, 10:25am

Speed also depends heavily on the type of memory allocated. Whether it’s device only, pinned, managed memory, etc…

Speed is also dependent on the operating system, and whether or not the card is running in TCC mode on Windows.

And finally, when cudaMalloc is called the first time, no CUDA context may yet exist. This triggers the creation of the CUDA context, which is a very slow operation (hundreds of ms)

Topic		Replies	Views
cudaMalloc's taking different times CUDA Programming and Performance	3	1997	December 22, 2010
Why does cudaMalloc time depends on kernel calling? cudaMalloc takes more time if you call a kernel CUDA Programming and Performance	3	11921	August 31, 2009
Questions about cudaMalloc Questions about runtime for cudaMalloc and cudaMemcpy CUDA Programming and Performance	1	3397	June 23, 2009
cudaMalloc execution time CUDA Programming and Performance	2	143	December 16, 2024
Memory Allocation Time Takes too much time!! CUDA Programming and Performance	3	4682	August 28, 2009
cudaMalloc CUDA Programming and Performance	1	5643	January 20, 2009
First cudaMalloc() takes long time? CUDA Programming and Performance	13	17506	April 23, 2021
Help regarding slow cudaMalloc CUDA Programming and Performance	9	10009	November 29, 2008
cudaMalloc problems CUDA Programming and Performance	3	2329	April 24, 2008
Calculate time ? CUDA Programming and Performance	5	2931	November 23, 2008

Is cudaMalloc slow when called multiple times?

Related topics