What is preemption-restore events?

kyle.li · March 22, 2022, 6:34am

Hi!
When I read introduction of NCU sm__ctas_launched_total, I find a term “preemption-restore events” that I don’t know what exactly it is. Can you help me to explain what it is?

felix_dt · March 22, 2022, 8:12am

The compute preemption feature provides for a way to avoid long running kernels from monopolizing the GPU, at the risk of context switch overheads associated with compute preemption. The preemption-restore events referred in the total metric correspond to these overheads. You can also find more information on this feature here.

kyle.li · March 24, 2022, 2:12am

If I would like to use an example to distinguish NCU sm__ctas_launched_total and sm__ctas_launched, what kind of demo is recommended? I suppose that a simple kernel could not show the difference.

kyle.li · March 26, 2022, 3:09am

hi,felix_dt! These days I was wondering what situation can be seen as preemption events but I still can not find a detailed situation. Can you give me a more detailed sample to understand these three metrics?

felix_dt · March 30, 2022, 8:32am

Nsight Compute increases the compute preemption timeout to multiple seconds for profiling to reduce the number of times that a profiled kernel is preempted in order to make the per-kernel results more precise. To still see preemption events, you could use two applications, one with a (infinitely) long running kernel and a second, profiled one with a kernel running at least ~5 seconds. You should then see sm__ctas_launched_total.sum to be potentially bigger than sm__ctas_launched.sum

$ infiniteKernel &
$ ncu --metrics sm__ctas_launched.sum,sm__ctas_launched_total.sum,gpu__time_duration.sum ./waitKernel
[...]
---------------------------------------------------------------------- -----
gpu__time_duration.sum                 second                           4.39
sm__ctas_launched.sum                  block                            1
sm__ctas_launched_total.sum            block                            2

---------------------------------------------------------------------- -----

Topic		Replies	Views
NSight Profiling Crashes with error code (9) Nsight Compute	11	4605	January 16, 2024
Question about profiling nccl kernels with Nsight Compute Nsight Compute	20	5041	February 13, 2025
Nsight-Compute returns “No kernels were profiled” warning Nsight Compute	9	1484	July 27, 2023
Is not there a replay-mode option? Nsight Compute	1	804	July 24, 2019
Nsight compute hanging issue Nsight Compute kernel	7	885	March 11, 2024
Takes days to profile my code Nsight Compute	6	1366	April 27, 2021
Why does NCU perform global serialized execution for all current kernels during kernel replay? Nsight Compute	5	726	December 5, 2023
How can I measure kernel launch overhead using ncu Nsight Compute	7	1358	May 4, 2023
How can I profile both kernel and cuda APIs hardware usage and application total duration Nsight Compute	5	425	March 27, 2024
NSight Compute vs. NSight Systems vs. PyTorch Profiler Nsight Compute	2	3392	March 23, 2024

What is preemption-restore events?

Related topics