Why the fps is not crossing 35 even though free GPU space available

sk.ahmed401 · November 21, 2020, 6:20am

with a single stream - GPU usage 1158 MB/5000 MB, fps 35
with 2 streams - GPU usage 1580 MB/5000 MB, fps 16 each.

Where am I doing wrong. How to utilize maximum GPU.

• Hardware Platform (GPU)
• DeepStream Version 5.0.1
• TensorRT Version 7.0
**• NVIDIA GPU Driver Version (valid for GPU only)**440.10

Fiona.Chen · November 23, 2020, 3:23am

The performance of the whole pipeline will be influenced by the actual usage of every element in the pipeline. The feature of the model, the usage of the model, the video sources performance, …

So it is hard for us to tell you anything without any detail in you pipeline.

sk.ahmed401 · November 23, 2020, 3:38am

Let me know what details you are expecting from me.

I have used the detectnet_v2 jupyter notebook to train a resnet18 based object detection model.
The input image size is 1920X1072.

For Deepstream, I followed the face_mask detection repo configurations.

Fiona.Chen · November 23, 2020, 4:07am

The whole application, the video you use, the model you use, the configuration files. The platform information (HW and SW).

Fiona.Chen · November 23, 2020, 4:08am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

sk.ahmed401 · November 23, 2020, 4:19am

[quote=“Fiona.Chen, post:6, topic:160096”]
• Hardware Platform (GPU) GTX 1660
• DeepStream Version 5.0.1
• TensorRT Version7.0.0
**• NVIDIA GPU Driver Version (valid for GPU only)**440.100
Config files are listed here
source1_video_barnet_gpu.txt (2.6 KB) config_infer_primary_barnet_gpu.txt (1.6 KB)
tlt config files.
tlt_configs.zip (103.4 KB)
I can’t provide the trained model and the video in open platform.

Fiona.Chen · November 23, 2020, 4:39am

Please do not choose “live-source=1” for testing local video files.
For performance testing, please use fakesink instead of eglsink.

sk.ahmed401 · November 23, 2020, 6:34am

I made the changes that you have suggested. The fps improved from 28 to 20.

[application]
enable-perf-measurement=1
perf-measurement-interval-sec=1

[tiled-display]
enable=1
rows=1
columns=2
width=1280 #640
height=960 #480
gpu-id=0


[source0]
enable=1
#Type - 1=CameraV4L2 2=URI 3=MultiURI
type=3
num-sources=1
uri=file:/opt/nvidia/deepstream/deepstream-5.0/samples/streams/barcode_video.mp4
gpu-id=0

[source1]
enable=1
#Type - 1=CameraV4L2 2=URI 3=MultiURI
type=3
num-sources=1
uri=file:/opt/nvidia/deepstream/deepstream-5.0/samples/streams/barcode_video.mp4
gpu-id=0
[streammux]
gpu-id=0
batch-size=2
batched-push-timeout=40000
## Set muxer output width and height
width=1920
height=1072
buffer-pool-size=400
#nvbuf-memory-type: 0-4
nvbuf-memory-type=1
#live-source: 1-live 0-default
#live-source=0

[sink0]
enable=1
#Type - 1=FakeSink 2=EglSink 3=File
type=1
sync=0
source-id=0
gpu-id=0
container=2
codec=1
bitrate=2000000
output-file=/opt/nvidia/deepstream/deepstream-5.0/samples/streams/barcode_video_output.MP4

[sink1]
enable=1
#Type - 1=FakeSink 2=EglSink 3=File
type=4
sync=1
source-id=1
gpu-id=0
container=2
codec=1
bitrate=2000000
#output-file=/opt/nvidia/deepstream/deepstream-5.0/samples/streams/barcode_video_output.MP4
#rtsp-port=8554
#udp-port=5400

Fiona.Chen · November 23, 2020, 6:41am

Since you have enabled [tiled-display], please disable [sink1].

sk.ahmed401 · November 23, 2020, 6:57am

Nothing improved

[application]
enable-perf-measurement=1
perf-measurement-interval-sec=1

[tiled-display]
enable=1
rows=1
columns=2
width=1280 #640
height=960 #480
gpu-id=0

[sink0]
enable=1
#Type - 1=FakeSink 2=EglSink 3=File
type=1
sync=1
source-id=0
gpu-id=0
container=2
codec=1
bitrate=2000000
output-file=/opt/nvidia/deepstream/deepstream-5.0/samples/streams/barcode_video_output.MP4

[sink1]
enable=0
#Type - 1=FakeSink 2=EglSink 3=File
type=1
sync=1
source-id=1
gpu-id=0
container=2
codec=1
bitrate=2000000
#output-file=/opt/nvidia/deepstream/deepstream-5.0/samples/streams/barcode_video_output.MP4
#rtsp-port=8554

Fiona.Chen · November 23, 2020, 8:19am

What do you want to improve? the FPS? What is current GPU loading and CPU loading?

sk.ahmed401 · November 23, 2020, 8:22am

The current GPU loading is 1158MB/5400MB. I want to know how I can use all my GPU in processing a video. In other words I like to see max fps of my model using deepstream.

Fiona.Chen · November 23, 2020, 9:58am

This is only GPU memory usage but not GPU loading.

You can get the monitoring data with “nvidia-smi dmon”

sk.ahmed401 · November 23, 2020, 1:21pm

I ran with “nvidia-smi dmon”

Fiona.Chen · November 24, 2020, 12:41am

GPU has already been occupied fully. GTX 1660 may be not powerful enough to handle multiple streams(higher FPS) with your model.

sk.ahmed401 · November 24, 2020, 1:54am

Thanks for the response @Fiona.Chen I will check again.

Topic		Replies	Views
Deepstream FPS drops when i add more and more RTSP streams DeepStream SDK	20	1943	November 6, 2023
Pixel distortions when the pipeline's FPS falls below the frame rate of the RTSP sources DeepStream SDK	7	321	May 14, 2024
L4 GPU not getting more than 15FPS for 80 rtsp stream DeepStream SDK	24	126	September 3, 2024
Did you have some sulotion to get how many GPU memory the deepstream use DeepStream SDK	8	27	August 27, 2024
Deepstream performance with 6 rtsp cameras DeepStream SDK hw , gstreamer	16	562	October 12, 2021
Batch RTSP streaming DeepStream SDK tensorrt , gstreamer	13	839	October 12, 2021
Low FPS for pruned tao toolkit models on deepstream DeepStream SDK	30	66	August 1, 2024
The number of cameras that DeepStream can support DeepStream SDK tensorrt , camera , gstreamer	3	603	December 15, 2023
Output video FPS not matching with the input video FPS DeepStream SDK gstreamer , fps	13	889	June 8, 2023
Low FPS, randomness RTSP Stream DeepStream SDK	12	1014	July 20, 2022

Why the fps is not crossing 35 even though free GPU space available

Related topics