How to adjust the paramerters to acclearte the yolov7 on deepstream? I got fps 8, i think it must be happened something wrong when i did

383796283 · November 21, 2022, 5:46am

TESLA T4
Deepstream-6.1.1 docker environment
[NVIDIA-AI-IOT - yolo_deepstream]
choesd yolov7.onnx → fp16.engine
batch-size = 16
detected sample_1080p_h264.mp4
model = yolov7.onnx
deepstream_test_config.txt (3.9 KB)

finally, max fps is 8.
(upload://1TdFWiMOdOAx5oNtJNPSizOBL24.txt) (3.1 KB)
command = deepstream-app -c deepstream_test_config.txt

mchi · November 21, 2022, 6:40am

can you refer to GitHub - NVIDIA-AI-IOT/yolo_deepstream: yolo model qat and deploy with deepstream&tensorrt

383796283 · November 21, 2022, 8:53am

yep, i followed yolo_deepstream/deepstream_yolo at main · NVIDIA-AI-IOT/yolo_deepstream · GitHub

mchi · November 21, 2022, 10:17am

But the sample in the project can get hundreds of fps, what’s the difference between yours and the project?

383796283 · November 21, 2022, 10:20am

i also want to know …

mchi · November 21, 2022, 1:41pm

do you mean you didn’t make any change to the github code?

383796283 · November 22, 2022, 12:44am

i do not change the code, the two config txts i have been uploaded.

mchi · November 22, 2022, 2:17am

can you use trtexec to run fp16.engine and check the QPS?

/usr/src/tensorrt/bin/trtexec --loadEngine=fp16.engine

the total_fps is: batch_number * batch/second ?

is 8 the total_fps or batch/second?

383796283 · November 22, 2022, 2:25am

first，qps：Throughput: 115.914 qps，Latency: min = 9.33789 ms, max = 14.3026 ms, mean = 9.67278 ms, median = 9.61938 ms, percentile(99%) = 10.2032 ms
[11/22/2022-02:23:05] [I] Enqueue Time: min = 0.906738 ms, max = 2.44945 ms, mean = 1.70993 ms, median = 1.74048 ms, percentile(99%) = 2.29266 ms
[11/22/2022-02:23:05] [I] H2D Latency: min = 0.406738 ms, max = 0.47998 ms, mean = 0.42492 ms, median = 0.422363 ms, percentile(99%) = 0.471191 ms
[11/22/2022-02:23:05] [I] GPU Compute Time: min = 8.26126 ms, max = 13.2157 ms, mean = 8.58368 ms, median = 8.53056 ms, percentile(99%) = 9.12329 ms
[11/22/2022-02:23:05] [I] D2H Latency: min = 0.653076 ms, max = 0.677811 ms, mean = 0.664174 ms, median = 0.663818 ms, percentile(99%) = 0.671997 ms
[11/22/2022-02:23:05] [I] Total Host Walltime: 3.02811 s
[11/22/2022-02:23:05] [I] Total GPU Compute Time: 3.01287 s
[11/22/2022-02:23:05] [W] * GPU compute time is unstable, with coefficient of variance = 3.40474%.
[11/22/2022-02:23:05] [W] If not already in use, locking GPU clock frequency or adding --useSpinWait may improve the stability.
[11/22/2022-02:23:05] [I] Explanations of the performance metrics are printed in the verbose logs.
second, the total_fps = 8*16=128

mchi · November 23, 2022, 12:37am

Did you boost T4 clocks?

With the YoloV7.onnx from the project, I can get ~137 fps as below.

383796283 · November 23, 2022, 2:46am

sorry，i do not know abult T4 clocks, can you explain more about it ?

mchi · November 24, 2022, 7:50am

boost GPU frequency

$ sudo nvidia-smi -pm ENABLED -i 0 // suppose T4 GPU id is 0
$ sudo nvidia-smi -ac “5001,1590” -i 0 // set memory clock and the graphics clock
$ nvidia-smi -q -d CLOCK -i 0 // confirm

And, since the fps you got is lower than what I got as screenshot above (115 vs 137), besides GPU clock, CPU capability may be another possible reason.

mchi · November 28, 2022, 3:24am

Hi @383796283
Any other question about this?

383796283 · December 5, 2022, 3:00am

nope
，thannks for your help

system · December 19, 2022, 1:21pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
PERF issues with DeepStream6.2 + YOLOv8 in Jetson Xavier DeepStream SDK jetson-inference , performance , yolo , fps , deepstream	8	979	September 26, 2023
Deepstream yolov4 process multiple streams is slow DeepStream SDK	7	1373	November 30, 2021
Custom YOLOv4 Model Performance DeepStream SDK deepstream	2	625	April 18, 2022
Inference with deepstream yolov5s-3.0 on 2 camera long delay (20-25s) DeepStream SDK	18	2278	October 12, 2021
Low FPS in Deepstream and Yolov4 on Jetson AGX Xavier DeepStream SDK	7	2229	October 12, 2021
Sudden high latenty in deepstream DeepStream SDK deepstream	15	46	April 29, 2025
GPU frame rate maxes when the GPU util isn't at max DeepStream SDK	6	998	November 9, 2021
Deepstream 4 + yolov3 multi source slow DeepStream SDK	9	1816	October 12, 2021
Yolov3 fps rather low on TX2 DeepStream SDK	7	629	October 12, 2021
YOLOv5S model performance testing benchmark DeepStream SDK jetson , deepstream	3	22	April 25, 2025

How to adjust the paramerters to acclearte the yolov7 on deepstream? I got fps 8, i think it must be happened something wrong when i did

boost GPU frequency

Related topics