We use command "./trtexec --onnx=model.onnx --useDLACore=0 --best --allowGPUFallback
The test result prove that “all layers run on DLA”.
And we use tegrastats to monitor the GPU usage. There are still have GPU usage.
Could you check using nsys to confirm if GPU is used. It could be possible that TRT is inserting some reformat layers before DLA work. Also, signalling from GPU to DLA and back happens inside TRT when cuDLA hybrid mode is used. It could be interpreted as GPU usage by tegrastats.
Tegrastats is not recommend to check usage when GPU+DLA is invloved. We recommend to use nsys to get more clarify on the usage.