We’re developing a real-time restaurant monitoring system using a Jetson Orin NX 16GB that processes 8 RTSP feeds from Reolink cameras. Our application extracts critical metadata including:
-
License plate detection and recognition
-
People counting and tracking
-
Zone-based time tracking (customer dwell time)
-
Parking space occupancy detection
The processed data needs to be displayed on a real-time dashboard with less than 2-second latency for operational decision-making.
Setup Details:
-
Hardware Platform: Jetson Orin NX 16GB
-
DeepStream SDK: 7.1
-
JetPack Version: 6.2.1+b38
-
TensorRT Version: 10.7.0.23-1+cuda12.6
-
Operating System: Ubuntu 22.04 (via JetPack 6.2)
-
Cameras: 8x Reolink cameras via RTSP
-
Model Format: FP16 precision
-
Approaches Tried:
-
Docker container with TensorRT optimization
-
Native DeepStream pipeline implementation
-
Problem
We’re experiencing approximately 15-20 second latency between the source RTSP feeds and the processed output, which makes real-time monitoring impossible. This delay is consistent across both our Docker/TensorRT and DeepStream implementations. Also sometime the script stop
Please check my attached zip folder for understanding the architecture. “deepstream_app.py“ is the main file.
deepstream_setup.zip (12.8 KB)
Requirements
-
Maximum acceptable latency: 2 seconds end-to-end
-
System runs continuously (15+ hours daily)
-
All 8 camera feeds must be processed simultaneously
Questions
-
What are the recommended DeepStream configurations for minimizing RTSP latency with multiple streams?
-
Should we consider different power modes (MAXN vs 15W) for better real-time performance?
-
Is the Orin NX 16GB capable of handling 8 simultaneous feeds with our latency requirements, or should we consider distributing the load?
-
Do we have to change our camera which will be directly connected to the jetson device? any suggestions for the camera?
I just need suggestion from expert to improve the architecture, so that in dashboard we can see the realtime data in less than 2 seconds.