Reduction of kernel's execution time that does not make sense

evabasis98 · January 11, 2018, 8:53pm

Hello!

I have 2 kernels, kernel1 and kernel2. kernel1 's time is 88.76 ms and kernel2 's time is 4.78 ms. But when kernel1 runs exactly before kernel2, then kernel2 's time goes to 2.56 ms.
Why is this happening? The two kernels uses differents arrays to execute.
Is there any idea?

Thank you in advance!

Robert_Crovella · January 11, 2018, 8:58pm

caching

managed memory effect

incorrect measurement

code defect

some aspect of cuda startup overhead, such as jit compilation

evabasis98 · January 11, 2018, 9:15pm

Hi txbob!! Thainks for your answer!

How can caching influence the kernel2 ‘s time when the kernel1’ s an kernel2 's data are different?
How can the kernel2 's data be cached before the kernel2 's execution?
do you mean tha way i manage the host and device memory?
do you mean incorrect measurement of execution time? No, this is not possible.
i don’t think there is a code defect since the results of both of the kernels are all correct.
i can not understand this. You would help me a lot if you could explain it.

Sorry for the number of question, but i am new to cuda and this is very weird to me.

Robert_Crovella · January 11, 2018, 11:21pm

Data can be cached by cudaMemcpy, or anything that touches memory. If the sequence of cudaMemcpy operations changes as a result of the kernel reordering, that could affect things.
I mean managed memory. If you don’t know what managed memory (try googling “CUDA managed memory”, then take any of the first 5 hits, rather than making me explain every single term), you probably arent using it.
OK
OK
CUDA has various kinds of startup time that must be incurred in any program. CUDA lazy initialization allows this to be smeared over the beginning of your program.

If there’s something I haven’t explained, you might want to try google first.

evabasis98 · January 11, 2018, 11:30pm

ok. thainks!

Topic		Replies	Views
Speed up due to a kernel launch ? CUDA Programming and Performance	3	1191	December 26, 2009
when a application runs, the first execution of a kernel will spend a longer time than the second. CUDA Programming and Performance	2	489	June 14, 2016
Problem with CudaMemcpy CUDA Programming and Performance	1	693	March 18, 2014
Same kernel and data exhibits different performance CUDA Programming and Performance	3	472	December 3, 2021
How different kernels affect the performance Performance issues CUDA Programming and Performance	3	4428	September 18, 2007
Inconsistent CUDA Kernel Execution Times in Sequential Execution CUDA Programming and Performance cuda	6	185	June 11, 2024
Timing Issue CUDA Programming and Performance	1	838	May 31, 2010
Why is the Kernel faster when my matrices are not initialized CUDA Programming and Performance	2	737	December 18, 2017
What could be possible reasons for affecting the kernel launch overhead for fast small kernels? CUDA Programming and Performance	5	24	October 22, 2024
Second kernel run is faster than first run CUDA Programming and Performance	2	1858	September 27, 2016

Reduction of kernel's execution time that does not make sense

Related topics