Phantom detections in cluster modes != 0

foreverneilyoung · February 13, 2021, 3:04pm

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
Jetson Nano
• DeepStream Version
5.0.1
• JetPack Version (valid for Jetson only)
4.5
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)

SEE BELOW

• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

I’m noticing phantom detections in all cluster-modes != 0 at high pre-cluster-threshold of 0.5.

I took a 50 seconds snip off of this video Morning Meeting and Warm up - Sysco Eastern WI - Capstone Logistics - YouTube and did run inference on it using the deepstream-test3.py sample

Here is the 50 s snip off for download

Leave deepstream-test3.py unchanged but apply these changes to dstest3_pgie_config.txt. IMHO all “normal” and legit changes:

diff --git a/apps/deepstream-test3/dstest3_pgie_config.txt b/apps/deepstream-test3/dstest3_pgie_config.txt
index a6a797f..16e3e65 100644
--- a/apps/deepstream-test3/dstest3_pgie_config.txt
+++ b/apps/deepstream-test3/dstest3_pgie_config.txt
@@ -59,23 +59,24 @@
 
 [property]
 gpu-id=0
+workspace-size=600
 net-scale-factor=0.0039215697906911373
-model-file=../../../../samples/models/Primary_Detector/resnet10.caffemodel
-proto-file=../../../../samples/models/Primary_Detector/resnet10.prototxt
-model-engine-file=../../../../samples/models/Primary_Detector/resnet10.caffemodel_b1_gpu0_int8.engine
-labelfile-path=../../../../samples/models/Primary_Detector/labels.txt
-int8-calib-file=../../../../samples/models/Primary_Detector/cal_trt.bin
+model-file=../../../../samples/models/Primary_Detector_Nano/resnet10.caffemodel
+proto-file=../../../../samples/models/Primary_Detector_Nano/resnet10.prototxt
+model-engine-file=../../../../samples/models/Primary_Detector_Nano/resnet10.caffemodel_b3_gpu0_fp16.engine
+labelfile-path=../../../../samples/models/Primary_Detector_Nano/labels.txt
 force-implicit-batch-dim=1
-batch-size=1
+batch-size=3
 process-mode=1
 model-color-format=0
-network-mode=1
+network-mode=2
 num-detected-classes=4
 interval=0
 gie-unique-id=1
 output-blob-names=conv2d_bbox;conv2d_cov/Sigmoid
+cluster-mode=3
 
 [class-attrs-all]
-pre-cluster-threshold=0.2
+pre-cluster-threshold=0.5
 eps=0.2
 group-threshold=1

With this setup you will notice a single shot person detection at frame 785 after about 10 seconds of no detection. With the same pre-cluster-threshold but cluster-mode=0 there is no such a phantom.

You might say, who gives a f… on such a single event. I would like to point out, that my use case is to detect possible collisions beforehand. As you can see, there is a long drive along an aisle in a warehouse. The algorithm shoots out of the sudden with a detection rectangle, which renders to an approximate distance of 1 m to the “person” ahead (aka ghost in this case). This would definitely have to result in a full-brake event. For nothing.

I need the “confidence” value, otherwise I would be OK to go with cluster-mode=0

bcao · February 22, 2021, 3:41am

Hey, If you set cluster-mode=3 that means DBSCAN algorithm, but eps and group-threshold are for cluster-mode=0, could you check Gst-nvinfer — DeepStream 6.1.1 Release documentation

foreverneilyoung · February 22, 2021, 8:33am

I said I have this fake detection phenomenon in ALL cluster modes != 0. The eps and group-threshold values are just something taken from one of your templates…

The real problem is, that I have no clue, what exactly all these cluster modes mean. And it is not documented.

Other than that I would expect, that entries, irrelevant for a cluster mode, are gracefully ignored. BTW: At leasts eps seems to be relevant for 3 too.

Did you try my setup?

bcao · March 2, 2021, 9:52am

Yes, it’s documented, could you check Gst-nvinfer — DeepStream 6.3 Release documentation for the corresponding config items for all the cluster-mode.

Please share the detailed repro steps if the issue still not be resolved, I will debug locally.

foreverneilyoung · March 2, 2021, 10:16am

Well, yes and no. The doc provides some buzz words w/o any context. So what exactly is behind this:

Integer 0: OpenCV groupRectangles() 1: DBSCAN 2: Non Maximum Suppression 3: DBSCAN + NMS Hybrid 4: No clustering

What is DBSCAN, NMS, Hybrid?!

Please share the detailed repro steps if the issue still not be resolved, I will debug locally.

All what you need is in my initial post. What else do you need?

bcao · March 3, 2021, 1:17am

DBSCAN(density-based spatial clustering of applications with noise ) , NMS(non max suppression) and hybrid are different types of clustering algorithms. The source code is available in /opt/nvidia/deepstream/deepstream/sources/libs/nvdsinfer/nvdsinfer_context_impl_output_parsing.cpp with comments. Hybrid option makes use of both dbscan and nms algorithms in a two step approach.

We will check internally to see if we can provide more info in the doc.

Ok, I see.

foreverneilyoung · March 3, 2021, 7:56am

Thanks for the extra explanation. I hope you will be able to reproduce these “phantom detections”

bcao · March 4, 2021, 6:58am

I can repro your issue, actully when set cluster-mode=0, there are still some false detection.
For your input video, I think you can try with our peoplenet, you only need to detect the people, right?

If yes, please try our people net, I tried it locally and it can work well. THe config file is under /opt/nvidia/deepstream/deepstream-5.1/samples/configs/tlt_pretrained_models/
labels_peoplenet.txt and you can get the model via wget https://api.ngc.nvidia.com/v2/models/nvidia/tlt_peoplenet/versions/pruned_v2.0/files/resnet34_peoplenet_pruned.etlt -O resnet34_peoplenet_pruned.etlt

foreverneilyoung · March 4, 2021, 7:24am

Cool. Will give it a try. Thanks. I suppose you currently don’t have an explanation for the glitches?

bcao · March 4, 2021, 7:40am

I think it should be caused by the model itself, the resnet10.caffemodel is used to detect Car, Bicycle, Person, Roadsign, it’s more suitable for a traffic use case, I’m not sure if the false detection bbox is from a wrong person or maybe just a wrong Roadsign, you can print the classid to confirm it.
But in my opinion, it’s more suitable to use people net which can detect people, face and bag if you only want to detect people, you can filter out other class(bag, face) via install a probe in the nvinfer downstream plugin.

foreverneilyoung · March 4, 2021, 9:01am

I’m not sure if the false detection bbox is from a wrong person or maybe just a wrong Roadsign, you can print the classid to confirm it.

It was a person. I double checked that.

foreverneilyoung · March 26, 2021, 6:41pm

I checked your suggestion. Does not work for me.

I downloaded the model from the location you provided
I copied the labels.txt and the config and stitched together this configuration:

[property]
workspace-size=1000
gpu-id=0
net-scale-factor=0.003921569790691137
tlt-model-key=tlt_encode
tlt-encoded-model=/home/neil/dragonfly-safety/jetson-inference/models/primary-detector-nano/resnet34_peoplenet_pruned.etlt
labelfile-path=/home/neil/dragonfly-safety/jetson-inference/models/primary-detector-nano/labels_peoplenet.txt
input-dims=3;544;960;0
uff-input-blob-name=input_1
batch-size=3
process-mode=1
model-color-format=0
network-mode=2
num-detected-classes=3
cluster-mode=1
interval=0
gie-unique-id=1
output-blob-names=output_bbox/BiasAdd;output_cov/Sigmoid
[class-attrs-all]
pre-cluster-threshold=0.4
eps=0.7
minBoxes=1

It generally works, also with three cams, but

a) I need to set workspace-size=1000 because otherwise I’m catching INFO: [TRT]: Some tactics do not have sufficient workspace memory to run. Increasing workspace size may increase performance, please check verbose output.

b) The initial creation of the model file takes about a minute. Correct?

c) With the given input_dims config (which is deprecated, infer_dims is suggested, but this seems to follow another format pattern) I can only achieve an inference rate of about 2 fps, even if I’m using one camera only. I’m usually using 3 USB cams and can achieve 30 fps inference rate per camera with the resnet10 model.

d) With input_dims like input-dims=3;244;244;0 I’m achieving about 10 fps. Not enough for me.

e) I finally thought to have found out, what infer_dims is, which I set to 3;640;480, because my input is 640x480. But the inference rate is still very, very low and the latency is exorbitant high compared to resnet10.

f) Most of the items around me are detected as “bag”, even part of my clothes. That doesn’t very much improve my situation.

=> Not that good

foreverneilyoung · March 26, 2021, 6:44pm

TO BE ADDED: Results are pretty good (not superior) with

  infer-dims=3;244;244

Any explanations for that?

foreverneilyoung · March 26, 2021, 8:27pm

I’m now at 24 fps for all three cams with the 244,244 setting above, but I’m not sure what this setting means.

The algorithm also has a lot of phantom detections, mostly bags. :/

Topic		Replies	Views
Faces and bags not detected on peoplenet DeepStream SDK	6	188	May 31, 2024
Implementing Real-Time, Multi-Camera Pipelines with NVIDIA Jetson Technical Blog	7	1516	July 9, 2024
Object Clustering using DBSCAN in Deepstream DeepStream SDK	7	590	August 10, 2022
resnet10.caffemodel_b8_fp16.engine is optimized for DeepStream SDK	10	1389	October 12, 2021
Negative confidence in DP 6.1.1 DeepStream SDK	7	463	September 22, 2023
Hello AI World - new object detection training and video interfaces Jetson Nano	29	4494	April 20, 2021
Some question about Deep stream 5 DeepStream SDK	42	1784	October 12, 2021
Deepstream and JetPack 3.3 DeepStream SDK	33	5015	January 29, 2019
FaceDetectIR in Deepstream 6.1, poor performance on IR but not on color? TAO Toolkit deepstream , deepstream61	4	625	October 18, 2022
DeepStream Pipeline Camera Sync Issue After Prolonged Running DeepStream SDK deepstream	19	40	March 24, 2025

Phantom detections in cluster modes != 0

Related topics