Calculate time ?

james6811993 · November 20, 2008, 9:52am

I want to calculate computing time, and I have a problem.

My experience result is as following.

Before executing kernel function, I execute some cudaMalloc() functions.

When I execute cutStartTimer( timer) before the first cudaMalloc(), I find the operation time is so extremely long (120 ms).

So I move cutStartTimer( timer) after the first cudaMalloc(), I find the time is so short (1.33ms).

I don’t know why the first cudaMalloc() wastes so long time (120 - 1.33) ?

Thanks for any info :blink:

MisterAnderson42 · November 20, 2008, 1:16pm

Dynamic memory allocation is very expensive
Is cudaMalloc() the first cuda* call in your program? The first such call also initializes the driver runtime and the GPU to prepare them for CUDA calculations. That also takes a significant amount of time.

apha2129 · November 21, 2008, 4:02am

So, is there any solution to displace the Dynamic memory allocation efficiently ?

Thanks for reply :rolleyes:

james6811993 · November 21, 2008, 4:53am

Are there Static memory allocation methods ?

Thank you for reply.

MisterAnderson42 · November 21, 2008, 2:09pm

Only allocate once at the beginning of the progam.

Of course. Just declare a device array. You will need to use cudaMemcpToSymbol to copy to it.

alex_dubinsky · November 23, 2008, 1:18am

Sure. Here’s one:

float myStaticMemory[1000];

Extend it to CUDA as per what MrAnderson said.

There’s also dynamic allocation that is light. Eg you can make your own stack.

float* stackmemory = malloc(1000000);

int stackpointer = 0;

float* myMalloc(int size) {

float* pointer = &stackmemory[stackpointer];

stackpointer+=size;

return pointer;

}

void myFree(int size) {

stackpointer-=size;

}

// EXAMPLE USE

for(int i= 0; i< 100; i+= 1){

	 a = myMalloc(i);

	 b = myMalloc(10*i);

	 // use a and b

	 myFree(10*i + i);

}

The above code will be much faster than calling malloc() multiple times. Extend the concept to cudaMalloc() as well.

Topic		Replies	Views
Memory Allocation Time Takes too much time!! CUDA Programming and Performance	3	4563	August 28, 2009
CudaMalloc is taking huge time for first time, How to overcome this issue CUDA Programming and Performance cuda	1	1012	April 12, 2021
cudamalloc slow CUDA Programming and Performance	5	8219	November 13, 2015
Why cudamalloc and cudaFree so expensive? CUDA Programming and Performance cuda	7	2683	November 14, 2020
cudaMalloc execution time CUDA Programming and Performance	2	20	December 16, 2024
cudaMalloc CUDA Programming and Performance	1	5589	January 20, 2009
cudaMalloc's taking different times CUDA Programming and Performance	3	1892	December 22, 2010
CUDA setup times (create context, malloc, destroy context) some measurements included CUDA Programming and Performance	19	23136	July 8, 2011
Allocate GPU buffers use a lot of time CUDA Programming and Performance	3	563	September 10, 2014
cudaMallocHost() vs. malloc() 1st "cudaMallocHost()" lasts ~90ms!! CUDA Programming and Performance	5	15057	July 3, 2007

Calculate time ?

Related topics