When executing cuda API occasionally, ioctl blocks for tens of milliseconds

When I call the cuda API (for example: cudaEventRecord or cudaMemcpyAsync) every tens of seconds, the first call to ioctl will block.
What is the specific reason for ioctl blocking, and how to solve this problem?
device:2080TI
cuda:11.7
driver:515.43.04
first execution:

complete timeline:
report1.nsys-rep (2.0 MB)