In instance segmentation, what should be the data in the mask?

yangye · February 21, 2023, 5:34am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)： GPU
• DeepStream Version：deepstream 6.0
• TensorRT Version：tensorrt8
• Issue Type( questions, new requirements, bugs) ：questions

When I try to deploy the yolov5s-seg instance segmentation model in deepstream, I have a problem that the mask of the target is not displayed, or displayed incompletely.

The image size is 1920×1080, and the network input size is 640×640. Through debugging, we know that the mask will not rescale to the original image size. Therefore, in the post-processing, I restore the mask to the original image, and the data in the mask is a value of 0-1.

In addition, through the following tests, we can confirm that there is no problem with the incoming data.

    obj.mask_size = kImageH * kImageW * sizeof(float);
    obj.mask = new float[kImageH * kImageW];
    obj.mask_width = kImageW;
    obj.mask_height = kImageH;

    float* rawMask = reinterpret_cast<float *>(masks.at(idx).data);
    memcpy (obj.mask, rawMask, obj.mask_size);
    // test for memcpy
    cv::Mat tmp(kInputH, kInputW, CV_32FC1, (void*)obj.mask);
    cv::Mat uchar_mat;
    tmp.convertTo(uchar_mat, CV_8UC1, 255);
    cv::imwrite(std::to_string(idx)+".jpg", uchar_mat);

below is the output:

But the final output mask is wrong, the result is as follows:

So, I want to know what the data in the mask should be。

Fiona.Chen · February 21, 2023, 5:39am

The mask pictures you post show correct masks.

E.G.

This is the mask for the person with backpack on his shoulder.

This is the mask for the bus:

yangye · February 21, 2023, 5:54am

the gray mask pictures is created during post-processing and is only used to judge whether the mask is correct. But the mask in the video file generated after inference is not good.

Fiona.Chen · February 21, 2023, 5:56am

What do you mean by “is not good”? Can you elaborate your requirement or the issue you found?

yangye · February 21, 2023, 5:58am

this is the output video file.You can see that the result of the mask is not good at all。

yangye · February 21, 2023, 5:59am

i upload the video，You can see that the result of the mask is not good at all。But the gray mask I output in post-processing is fine.

Fiona.Chen · February 21, 2023, 6:02am

You need to scaling the output back according to the model preprocessing. How did you do the preprocess scaling with your yolov5s-seg model? Is there “keep-aspect-ratio” operation? What is the size of the mask output matrix(640x640 or other size)?

Fiona.Chen · February 21, 2023, 6:05am

Is this the original video size, nvstreammux size or finally display size?

yangye · February 21, 2023, 6:06am

the image size is 1920 × 1080, the network input is 640×640, with “keep-aspect-ratio” operation。the output of mask size should be 640 × 640, but i have rescaled the mask to 1920 x 1080.

Fiona.Chen · February 21, 2023, 6:06am

Is this the original video size, nvstreammux size or finally display size?

yangye · February 21, 2023, 6:07am

[streammux]
gpu-id=0
##Boolean property to inform muxer that sources are live
live-source=0
batch-size=1
##time out in usec, to wait after the first buffer is available
##to push the batch even if the complete batch is not formed
batched-push-timeout=40000
## Set muxer output width and height
width=1920
height=1080
##Enable to maintain aspect ratio wrt source, and allow black borders, works
##along with width, height properties
enable-padding=0
nvbuf-memory-type=0

Fiona.Chen · February 21, 2023, 6:09am

What is the final display size? Where did you scale the mask?

Fiona.Chen · February 21, 2023, 6:28am

Suppose your final display resolution is 1920x1080, since your model accept the padded scaled image as input, the output is also padded. So you need to scale the valid part(removing the padding parts) of the output to the display resolution.

yangye · February 21, 2023, 6:52am

the final display size is 1920x1080. yes, I rescale the mask from 640 × 640 to 1920× 1080 considering the padding. code show as below:

// the input mask_mat shape is 640 x 640
 cv::Mat img_mask = scale_mask(mask_mat, kImageH, kImageW);

// rescale code 
cv::Mat scale_mask(cv::Mat mask, uint32_t img_h, uint32_t img_w) {
  int x, y, w, h;
  float r_w = kInputW / (img_w * 1.0);
  float r_h = kInputH / (img_h * 1.0);
  if (r_h > r_w) {
    w = kInputW;
    h = r_w * img_h;
    x = 0;
    y = (kInputH - h) / 2;
  } else {
    w = r_h * img_w;
    h = kInputH;
    x = (kInputW - w) / 2;
    y = 0;
  }
  cv::Rect r(x, y, w, h);
  cv::Mat res;
  cv::resize(mask(r), res, cv::Size2d(img_w, img_h));
  return res;
}

yangye · February 21, 2023, 6:55am

Post-processing operations are done in NvDsInferParseYolov5Seg function.

// nvinfer config file
cluster-mode=4
# lib path for instance segmentation
parse-bbox-instance-mask-func-name=NvDsInferParseYolov5Seg
custom-lib-path=/opt/nvidia/deepstream/deepstream-6.0/instanceSeg_yolov5/nvdsinfer_yolov5_seg_impl/libnvinfer_yolov5_seg.so
	
output-instance-mask=1
	
segmentation-threshold=0.3

Fiona.Chen · February 21, 2023, 8:26am

How can you guarantee the mask is for the frame you paste?

yangye · February 21, 2023, 8:52am

we just see the bus object，the gray picture is saved in NvDsInferParseYolov5Seg function，the RGB picture is output with the pipeline in the video. They are all the first frame data of the video as the result of the input。I think the mask data is changed in somewhere, but I can’t locate it right now.

Fiona.Chen · February 21, 2023, 9:07am

So the mask is correct. The issue has nothing to do with DeepStream. Please debug your code.

yangye · February 21, 2023, 9:16am

??? I mean the RGB image’s mask should same as the gray image, but they are not the same. the RGB image is output by nvosd.

Fiona.Chen · February 21, 2023, 1:23pm

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one. Thanks

Are you displaying such video with the sample NVIDIA-AI-IOT/deepstream_tao_apps: Sample apps to demonstrate how to deploy models trained with TAO on DeepStream (github.com)?

Topic		Replies	Views
Mask_param data interpret DeepStream SDK deepstream	31	51	December 26, 2024
Deepstream-app can not visual segmentation mask? DeepStream SDK	8	1036	October 12, 2021
Yolov5 Segmentation model deployed on DeepStream5.1, the mask to the original image is wrong DeepStream SDK	7	559	June 14, 2023
How to draw mask by using yolov8-seg model in python DeepStream SDK camera , gstreamer , python , deepstream	13	116	November 5, 2024
Issue With segmentation mask DeepStream SDK	13	510	February 27, 2024
Unable to Get Segmentation Masks Values and Colors DeepStream SDK	8	391	February 20, 2024
How to draw masks with python DeepStream SDK nvbugs , deepstream , chinese	9	69	September 30, 2024
Segmentation Fault on passing mask, when postprocessing is bypassed to python in Deepstream 6.2 DeepStream SDK cudnn	9	148	July 23, 2024
DeepStream outputs different segmentation mask shape than ONNX model has itself DeepStream SDK	3	40	July 19, 2024
Failed to run deepstream-segmentation-analytics in deepstream DeepStream SDK	12	827	November 3, 2021

In instance segmentation, what should be the data in the mask?

Related topics