Hello, I am using Nvidia Deepstream for instance segmentation. I want to use the mask in the instance segmentation to embed a secondary detector to detect objects in the mask. I currently have the main detector and the secondary detector configured, but my secondary detector does not detect objects in the mask, but detects objects in the detection box of the entire segmented object.
My pipeline is
rtmpsrc → flvdemux ->h264parse->nvv4l2decoder->streammux->nvinfer->queue->nvinfer->nvvidconv->nvosd->nvv4l2h264enc->h264parse->flvdemux->rtmpsink
nx@xavier:~$ sudo deepstream-app -v
deepstream-app version 6.3.0
DeepStreamSDK 6.3.0
Please provide complete information as applicable to your setup.
• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)
About instance segmentation application, please refer to peopleSegNet. About detection model as secondary detector, please refer to back-to-back-detectors sample.
what are the detailed inputs and outputs of two models? if the first model can output bboxes, you can ignore the mask because the second model only needs detection box of the entire segmented object.
I have completed the link between the first model and the second model. But now I want to use the mask area of the first model into my second detection model instead of the detection box into the second model detection.
about “but detects objects in the detection box of the entire segmented object.”, is detection box not a rectangle?
nvinfer only supports rectangle area, polygon or Irregular area like mask are not supported. you need to use the rectangle coordinates which includes the mask.
please refer to the point2 in my second comment. Taking peopleSegNet for example, this model can output bboxes and mask. sgie detection model only needs the bboxes. nvosd will draw the mask.