Advice of semantic segmentation on agx

fcj · June 17, 2021, 1:15am

I am looking for a semantic segmentation model which can run real time on AGX platform.
The input image resolution is 1920x1080.
Is any benchmark of semantic segmentation model running on AGX platform available?

Thanks,
FC
.

AastaLLL · June 17, 2021, 3:01am

Hi,

Please check the following tutorial for information:

github.com

dusty-nv/jetson-inference/blob/master/docs/segnet-console-2.md

<img src="https://github.com/dusty-nv/jetson-inference/raw/master/docs/images/deep-vision-header.jpg" width="100%">
<p align="right"><sup><a href="detectnet-example-2.md">Back</a> | <a href="segnet-camera-2.md">Next</a> | </sup><a href="../README.md#hello-ai-world"><sup>Contents</sup></a>
<br/>
<sup>Semantic Segmentation</sup></s></p>

# Semantic Segmentation with SegNet
The next deep learning capability we'll cover in this tutorial is **semantic segmentation**.  Semantic segmentation is based on image recognition, except the classifications occur at the pixel level as opposed to the entire image.  This is accomplished by *convolutionalizing* a pre-trained image recognition backbone, which transforms the model into a [Fully Convolutional Network (FCN)](https://arxiv.org/abs/1605.06211) capable of per-pixel labeling.  Especially useful for environmental perception, segmentation yields dense per-pixel classifications of many different potential objects per scene, including scene foregrounds and backgrounds.

<img src="https://github.com/dusty-nv/jetson-inference/raw/pytorch/docs/images/segmentation.jpg">

[`segNet`](../c/segNet.h) accepts as input the 2D image, and outputs a second image with the per-pixel classification mask overlay.  Each pixel of the mask corresponds to the class of object that was classified.  [`segNet`](../c/segNet.h) is available to use from [Python](https://rawgit.com/dusty-nv/jetson-inference/pytorch/docs/html/python/jetson.inference.html#segNet) and [C++](../c/segNet.h).  

As examples of using the `segNet` class, we provide sample programs C++ and Python:

- [`segnet.cpp`](../examples/segnet/segnet.cpp) (C++) 
- [`segnet.py`](../python/examples/segnet.py) (Python) 

These samples are able to segment images, videos, and camera feeds.  For more info about the various types of input/output streams supported, see the [Camera Streaming and Multimedia](aux-streaming.md) page.

See [below](#pretrained-segmentation-models-available) for various pre-trained segmentation models available that use the FCN-ResNet18 network with realtime performance on Jetson.  Models are provided for a variety of environments and subject matter, including urban cities, off-road trails, and indoor office spaces and homes.

This file has been truncated. show original

The fcn-resnet18-cityscapes-2048x1024 model can achieve 47fps for 2048x1024 input on Xavier.

Thanks.

fcj · June 17, 2021, 5:11am

I have tried fcn-resnet18-cityscapes-2048x1024 model. Unfortunately, the output segmentation map is 1/16 of input image resolution which is not suitable to my application.
My application requires that segmentation output has the same resolution as input image.

Thanks,
FC

dusty_nv · June 17, 2021, 2:28pm

Hi @fcj, the output will be rescaled using bilinear or nearest-neighbor interpolation to whatever size output image you feed into segNet. The raw grid on FCN segmentation models is typically always a fraction of the size of the input, and it just gets upsampled to the original size. The only difference is that I perform the upsampling manually in CUDA because it is faster than the way that PyTorch does the upsampling in the original model.

fcj · June 17, 2021, 6:11pm

Hi, @dusty_nv

object boundary is quite blurry by upscaling 16 times of segmentation which is not acceptable to my application. Most semantic segmentation network has a decoder in the network which can do upsampling without blurring object boundary.

Thanks,

dusty_nv · June 17, 2021, 6:36pm

The PyTorch FCN-ResNet segmentation models use an upsample instead of decoder, but still that part is often the slowest part and may slow it down to being sub-realtime, particularly on HD resolution. If you were to use a model from PyTorch that still has the upsample/decoder built-in, you could use that instead.

system · June 25, 2021, 7:29am

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Semantic Segmentation with SegNet Jetson Nano jetson-inference	5	1528	October 18, 2021
Will there be a centralised workflow for creating custom models for semantic segmentation on Jetson devices? Jetson Nano machine-learning , segmentation	4	215	May 21, 2024
Dusty-nv sgmentation threshold setting option? Jetson Nano jetson-inference	6	383	June 14, 2023
The output result of segnet-console.py is blurred. Jetson Nano	3	666	October 15, 2021
Train Custom image Segmentation Model jetson_inference Jetson Nano ai-training	4	3182	October 15, 2021
Deeplab v3 semantic segmentation on TX2 Jetson TX2 jetson-inference	4	1409	October 18, 2021
How to run Schematic Segmentation samples in Nano Jetson Nano	18	3890	October 18, 2021
Unable to get segmentation to work with Jetson TX2 Jetson TX2	25	6632	October 18, 2021
Live Camera Segmentation Jetson AGX Xavier jetson-inference	2	762	April 13, 2022
Face image segmentation for bicolor 3D printing Jetson Nano jetson-inference	4	35	February 20, 2025

Advice of semantic segmentation on agx

Related topics