NvV4L2 decoder (maybe) interrupts CUDA context

locki.janos.m · August 11, 2022, 10:58pm

Hello!

We have system that analyze 10 rtsp stream parallel. And we have performance issue with Gst NvV4L2 decoder in JP 4.6.2 .

In Nsight System 2021.5.4 we noticed that CUDA execution context are interrupted during the decoding. So our first issue is that CUDA kernels are possibly ** not parallel ** with decoding tasks (but with encoding they are).

But it hard to notice because NvV4L2 decoding tasks are missing from Nsight System trace. Same happens in gst-lunch example code. So this is the second issue.

CUDA_INJECTION64_PATH="/opt/nvidia/nsight_systems/libToolsInjection64.so" LD_PRELOAD="/opt/nvidia/nsight_systems/libToolsInjectionProxy64.so" QUADD_INJECTION_PROXY="cuDNN, cuBLAS, NvMedia" gst-launch-1.0 -v rtspsrc location="any_kind_of_rtsp" ! rtph264depay ! queue ! nvv4l2decoder enable-max-performance=1   ! nvvidconv !  nvv4l2h265enc maxperf-enable=1 bitrate=8000000 ! fakesink

I hope you can say something for this. We will try to reproduce our first issue in example code.

AastaLLL · August 12, 2022, 2:39am

Hi,

Does this only happen to RTSP input?
Could you check if a video or camera source also has the same issue?

Thanks.

locki.janos.m · August 15, 2022, 6:01am

Hello!

We checked, video file source has this issue also.

AastaLLL · August 16, 2022, 5:56am

There is no update from you for a period, assuming this is not an issue any more.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

Hi,

Thanks for the confirmation.

We want to reproduce this issue internally and check it further.
Could you share the detailed steps with us (with video source)?

Thanks.