Using NSight profiling on a timeline I saw repeated calls of cuCtxAttach cuCtxDetach.
There are about 20 pairs of attach detach sequential calls (see attached screen of NSight timeline) between different kernels execution.
Can anybody explain me what do they do?
Is it possible to avoid them to increase the calculation performance?
I’m using runtime api in my application.
Thanks in advance!