Please provide complete information as applicable to your setup.
• Hardware Platform (Jetson / GPU): dGPU
• DeepStream Version: 8.0
• JetPack Version (valid for Jetson only)
• TensorRT Version: 10.9.0.34
• NVIDIA GPU Driver Version (valid for GPU only): L20
• Issue Type( questions, new requirements, bugs): questions
Hello NVIDIA team, I wanna ask about this problem I have when I am using the deepstream-server sample application for my pipeline. I am very new to all of this so your guidance will help a lot.
Currently, I am trying to create a pipeline where:
- I can add/delete models dynamically without restarting the server,
- i can add/delete streams without stopping and restarting the server
Using the files here: https://drive.google.com/drive/folders/1llRT7l-UgnPArC1AoFgRmEJhL9M8CEap?usp=sharing
I basically do:
Restart docker compose
docker compose down && docker compose up -d
Terminal 1: Triton Server
cd /root/saas-triton-deepstream
docker compose exec deepstream bash
cd /workspace/model_repository
tritonserver --model-repository=/workspace/model_repository \
--model-control-mode=poll \
--repository-poll-secs 30 \
--pinned-memory-pool-byte-size 1073741824 \
--cuda-memory-pool-byte-size 0:4294967296
Terminal 3: Run Stream
source /root/samurai-copilot-saas/testenv/bin/activate
python /root/saas-triton-deepstream/scripts/stream_publisher.py /root/saas-triton-deepstream/test-media/sample_1080p_h264_20fps.mp4 -n 32
Terminal 2: DeepStream with REST API
cd /root/saas-triton-deepstream
docker compose exec deepstream bash
export GST_PLUGIN_PATH=/opt/nvidia/deepstream/deepstream-8.0/lib/gst-plugins:$GST_PLUGIN_PATH
export LD_LIBRARY_PATH=/workspace/lib:/opt/nvidia/deepstream/deepstream-8.0/lib:$LD_LIBRARY_PATH
/workspace/deepstream-app-server/start_deepstream_server.sh
I test this and add the streams one-b-one to check for when the bottleneck occurs. And It seems when the streams reach the 19 counts it starts to go bad:
example with 16 streams:
**PERF : FPS_0 (0.00) FPS_1 (19.90) FPS_2 (19.90) FPS_3 (19.89) FPS_4 (19.88) FPS_5 (19.88) FPS_6 (19.89) FPS_7 (19.89) FPS_8 (19.88) FPS_9 (19.84) FPS_10 (19.86) FPS_11 (19.88) FPS_12 (19.88) FPS_13 (19.88) FPS_14 (19.82) FPS_15 (19.82)
results using 21 streams:
**PERF : FPS_0 (0.00) FPS_1 (19.60) FPS_2 (19.60) FPS_3 (19.60) FPS_4 (19.42) FPS_5 (19.42) FPS_6 (19.42) FPS_7 (19.40) FPS_8 (19.02) FPS_9 (18.95) FPS_10 (18.95) FPS_11 (18.87) FPS_12 (18.87) FPS_13 (18.87) FPS_14 (18.77) FPS_15 (18.77) FPS_16 (18.69) FPS_17 (15.51) FPS_18 (15.50) FPS_19 (16.62) FPS_20 (16.65)
nvidia-smi dmon output:
gpu pwr gtemp mtemp sm mem enc dec jpg ofa mclk pclk
Idx W C C % % % % % % MHz MHz
0 133 45 - 15 8 0 8 0 0 9001 2520
0 122 47 - 26 15 0 15 0 0 9001 2520
0 136 47 - 21 10 0 7 0 0 9001 2520
0 151 48 - 26 15 0 14 0 0 9001 2520
0 125 46 - 15 8 0 7 0 0 9001 2520
0 130 45 - 26 15 0 15 0 0 9001 2520
0 133 48 - 25 15 0 15 0 0 9001 2520
0 123 46 - 17 9 0 8 0 0 9001 2520
0 132 46 - 24 14 0 11 0 0 9001 2520
0 140 45 - 17 12 0 15 0 0 9001 2520
0 121 46 - 24 14 0 9 0 0 9001 2520
0 135 45 - 26 15 0 14 0 0 9001 2520
0 150 46 - 16 9 0 7 0 0 9001 2520
0 129 47 - 21 11 0 8 0 0 9001 2520
0 133 47 - 24 14 0 10 0 0 9001 2520
0 145 45 - 26 15 0 15 0 0 9001 2520
0 116 46 - 15 8 0 7 0 0 9001 2520
0 135 46 - 26 15 0 15 0 0 9001 2520
0 126 46 - 26 15 0 15 0 0 9001 2520
0 131 46 - 26 15 0 15 0 0 9001 2520
0 133 45 - 25 15 0 15 0 0 9001 2520
0 133 46 - 16 10 0 14 0 0 9001 2520
0 122 46 - 24 15 0 15 0 0 9001 2520
0 134 46 - 24 15 0 15 0 0 9001 2520
0 137 45 - 26 16 0 15 0 0 9001 2520
0 128 47 - 14 8 0 9 0 0 9001 2520
0 140 46 - 16 8 0 9 0 0 9001 2520
0 138 45 - 24 14 0 8 0 0 9001 2520
0 122 46 - 15 9 0 14 0 0 9001 2520
So, I also already tried the formula from:
DeepStream tune fps and also from several AI platforms.
So now I kinda stuck because I dunno what else should I do to enhance the GPU usage and performance overall. My goal is to be able to stream at least 20 streams at the same time with stable fps output.
FYI im using a ECS instance from Alibaba Cloud for this. Running it in my VSC with SSH because I dunno where else should i do it. Any help is greatly appreciated!!