Delay with live inference

gabrielmstefanello058 · July 2, 2024, 4:00pm

• Hardware Platform (Jetson / GPU) Jetson Orin Nano Dev kit
• DeepStream Version 7.0
• JetPack Version (valid for Jetson only) 6
• TensorRT Version 8.6.2
• Cuda Version 12.2

• Issue Type( questions, new requirements, bugs)
I’m facing a problem with trying to do live-time inference with cameras videos. I am trying to use peoplesegnet to inference people segmentation and also trying to track them with nvmultiobjecttracker.so.
The main issue is that when I run my pipeline with the inference model and the nvTracker I get a huge delay, about 4s delayed between the live frame and the displayed frame by the pipeline. I’m trying to reduce this delay and I noticed that when I turn off Inference and Tracking the delay drops to a non-significant delay, my doubt is that does all this latency between frame and reality comes from those 2 sections from my pipeline?

my pipeline sequence is:
videocorverter → captureFilters → nvStreammux → Inference - > nvTracker → Tiler → OSD → converterSink → capsFilter → Sink

tracker_config.txt (7.9 KB)
(for any purpose)

kesong · July 3, 2024, 6:35am

Please run below command before run your pipeline:

Max power mode is enabled: $ sudo nvpmodel -m 0.
The GPU clocks are stepped to maximum: $ sudo jetson_clocks

Please share the log of below command line:
$ sudo tegrastats

gabrielmstefanello058 · July 3, 2024, 6:17pm

Heres the log of tegrastats while running the pipeline with inference and tracker
output_tegra.txt (33.8 KB)

kesong · July 3, 2024, 9:39pm

The GPU utilization is 99%. You can run nsys to check if the GPU utilization is reasonable.

gabrielmstefanello058 · July 4, 2024, 5:06pm

Yes, I know that. My question is about the delay between the frame and the real time frame, what is the correlation of gpu usage and the delay?

I am using right now a jetson orin nano and I can get 8fps. But if I run with a different gpu like a jetson agx orin I get 22fps but the delay stills over there

kesong · July 5, 2024, 8:51am

Seams GPU can’t process in real-time. It will cause delay. You need more powerful Jetson or optimize your model.

gabrielmstefanello058 · July 8, 2024, 5:19pm

I’m using peoplesegnet with int 8 quantization
(PeopleSegNet | NVIDIA NGC) because from what I have seen it looks like the best model considering accuracy and performance(framerate). Is there a better optimization that I could do or another model to try to see if gpu is the main source of the bottleneck from the pipeline?

kesong · July 9, 2024, 3:49pm

You can use nsys to check which module consumed the GPU.

yingliu · August 9, 2024, 6:18am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

system · August 23, 2024, 6:19am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Delay in one of the multistream tiler outputs DeepStream SDK rtsp	11	1643	October 12, 2021
Using a Gstreamer Tee element in inference pipeline DeepStream SDK	9	4051	October 12, 2021
RTSP live source. Discard past frames from buffer and go for newest one DeepStream SDK	11	546	October 12, 2021
Output display lags which is far from real time DeepStream SDK	7	745	January 17, 2023
Nvidia jetson detectnet increasing latency Jetson Nano jetson-inference , ai	9	1649	October 15, 2021
How to maximize inferences/sec in a deepstream pipeline DeepStream SDK	13	1015	October 12, 2021
Remove gstreamer pipeline buffering DeepStream SDK gstreamer , deepstream	15	1373	October 2, 2023
Peoplnet Performace in Deepstream Pipeline DeepStream SDK python , inception	4	559	June 6, 2023
RTSP stream delay DeepStream SDK	8	1757	October 12, 2021
Peoplenet performance on Jetson DeepStream SDK	7	1263	October 12, 2021

Delay with live inference

Related topics