How to set the correct config for a pytorch model in nvinfer?

marmikshah · April 9, 2021, 2:56am

Hello,
I have trained a Resnet classifier in PyTorch and exported to an ONNX model.

The FC layer of Resnet18 is set to:

model.fc = nn.Sequential(
        nn.Linear(model.fc.in_features, args.n_classes, bias=False),
        nn.Softmax(1),
    )

These are the normalization values I have used.

normalize = transforms.Normalize(
        mean=[0.485, 0.456, 0.406],
        std=[0.229, 0.224, 0.225])

The model accuracy goes up to 80%, but when I infer (using nvinfer) on the exact same images used in training, the result is very different.

I have used the following Offsets & scale factor


# RGB, torchvision = 255*[0.485;0.456;0.406]
offsets=123.675;116.28;103.53

maintain-aspect-ratio=1

#net-scale-factor=0.003921569
net-scale-factor=0.01735207357

With that in mind, I wanted to know

How to set the correct offsets and net-scale-factor to match my training normalization values?
Is there any place where I can find some examples of how nvinfer does the asymmetric padding when maintain-aspect-ratio=1
Is there any official documentation to use a pytorch model as an SGIE in Deepstream?

Thanks.

AastaLLL · April 9, 2021, 3:47am

Hi,

1

The preprocessing equation used in the pyTorch is y = (x - mean) / std .
In deepstream, it is y' = net-scale-factor * ( x' - mean').
So please set the mean'=mean, and net_scale_factor=1/std.

One problem is that we don’t support channel-wise normalization.
You can either use the average std value or update the source code below:

/opt/nvidia/deepstream/deepstream-5.0/sources/libs/nvdsinfer/nvdsinfer_context_impl.cpp

2. Do you mean the asymmetric padding in the network architecture, like conv?
If yes, the network definition should be identical to the training frameworks.

3. Below is a sample for ONNX model.
Although it is used as PGIE, you can follow the same for SGIE.

github.com

NVIDIA-AI-IOT/deepstream_pose_estimation/blob/master/deepstream_pose_estimation_config.txt

# Copyright 2020 - NVIDIA Corporation
# SPDX-License-Identifier: MIT

# Following properties are mandatory when engine files are not specified:
#   int8-calib-file(Only in INT8)
#   Caffemodel mandatory properties: model-file, proto-file, output-blob-names
#   UFF: uff-file, input-dims, uff-input-blob-name, output-blob-names
#   ONNX: onnx-file
#
# Mandatory properties for detectors:
#   parse-func, num-detected-classes,
#   custom-lib-path (when parse-func=0 i.e. custom),
#   parse-bbox-func-name (when parse-func=0)
#
# Optional properties for detectors:
#   enable-dbscan(Default=false), interval(Primary mode only, Default=0)
#
# Mandatory properties for classifiers:
#   classifier-threshold, is-classifier
#

This file has been truncated. show original

Thanks.

marmikshah · April 9, 2021, 4:16am

Hi Aasta,

Thank you for your quick reply. I will take a look at point 1 and 3.

For asymmetric padding I want to know what kind of input is being sent to the model.
In the documentation of nvinfer’s properties it is mentioned “Indicates whether to maintain aspect ratio while scaling input. DeepStream currently does asymmetric padding only.” for maintain-aspect-ratio

Does that mean that there is another layer added which does the padding? Or is it like a pre-processing step done in nvinfer before the object is sent to the model? If so, is there any way to see how the image looks after the padding is added.
I simply want to verify whether the input image to nvinfer (after padding) matches my preprocessing when training in Pytorch.

Thanks

AastaLLL · April 19, 2021, 9:08am

Hi,

There are two different padding available.

For Deepstream, there is a padding that try to feed input data into network buffer.
In this usecase, the related to configure file is maintain-aspect-ratio.

There is also a layer-level padding which is implemented by TensorRT.
You can find some information below:

Thanks.

Topic		Replies	Views
Can't configure DeepStream classifier to give the same softmax outputs as the TRT engine it builds DeepStream SDK deepstream , config	24	1201	January 4, 2024
Replicating PyTorch image pre-processing into TRT DeepStream DeepStream SDK tensorrt	4	1024	October 9, 2021
Scale_factor and channel_offsets DeepStream SDK	2	279	November 9, 2023
PyTorch normalization in Deepstream config DeepStream SDK	5	3391	October 12, 2021
Discrepancy between PyTorch and DeepStream inference when deploying a custom ReID model DeepStream SDK deepstream	7	144	November 7, 2025
Normalizing objects (yolo output) to be processed for the secondary classifier DeepStream SDK	10	1135	October 12, 2021
Reflecting Pytorch Normalize transform parameter to Deepstream configuration DeepStream SDK pytorch , inference-server-triton	4	1534	October 12, 2021
DeepStream TRT preprocessing settings DeepStream SDK	2	852	October 12, 2021
Custom (TF) Model - Which config to set in nvinfer? DeepStream SDK	4	301	March 15, 2024
Difference between predictions of exported TensorRT engine and PyTorch pth models DeepStream SDK	22	2603	March 14, 2023

How to set the correct config for a pytorch model in nvinfer?

Related topics