Multithreaded tensorRT performance drops dramatically

yes, create a topic: Nsys cannot collect cuda information on Drive OS 5.1