Export onnx.py is failing for own camera captured data sets

dipankar123sil · December 25, 2020, 5:37pm

Following error is observed on Jetson nano 2GB while exporting onnx model. Training was successful
Traceback (most recent call last):
File “onnx_export.py”, line 86, in
net.load(args.input)
File “/jetson-inference/python/training/detection/ssd/vision/ssd/ssd.py”, line 135, in load
self.load_state_dict(torch.load(model, map_location=lambda storage, loc: storage))
File “/usr/local/lib/python3.6/dist-packages/torch/serialization.py”, line 571, in load
with _open_file_like(f, ‘rb’) as opened_file:
File “/usr/local/lib/python3.6/dist-packages/torch/serialization.py”, line 229, in _open_file_like
return _open_file(name_or_buffer, mode)
File “/usr/local/lib/python3.6/dist-packages/torch/serialization.py”, line 210, in init
super(_open_file, self).init(open(name, mode))
IsADirectoryError: [Errno 21] Is a directory: ‘models/face/’

Tried the solution provided in “https://forums.developer.nvidia.com/t/onnx-export-py-outputs-size-mismatch-for-classification”

Thers’s no newline/endline characted in both own created labels.txt or pytorch created one. There’s only one difference that pytorch generated model ha one more extra label ‘BACKGROUND’.
But I don’t expect it to be necessary as as in video tutorial “Jetson AI Fundamentals - S3E5 - Training Object Detection Models - YouTube” no class for background was created.
Can anyone help ?
Thanks & Regards,
Dipankar Sil

dipankar123sil · December 25, 2020, 8:00pm

I tried adding --input argument then the issue bounced back to similar issue of size mismatch as mentioned in “Onnx_export.py outputs size mismatch for classification_headers.0.weight / bias errors”
But i tried to diff between labels.txt created by me and then by Pytorch, following is the output
“**diff data/face/labels.txt models/face/labels.txt **
0a1
> BACKGROUND”
As we can see there’s no difference in new line character in both the files.
current error after passing --input argument is this
"Traceback (most recent call last):
File “onnx_export.py”, line 86, in
net.load(args.input)
File “/jetson-inference/python/training/detection/ssd/vision/ssd/ssd.py”, line 135, in load
self.load_state_dict(torch.load(model, map_location=lambda storage, loc: storage))
File “/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py”, line 1045, in load_state_dict
self.class.name, “\n\t”.join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for SSD:
size mismatch for classification_headers.0.weight: copying a param with shape torch.Size([126, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([18, 512, 3, 3]).
size mismatch for classification_headers.0.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([18]).
size mismatch for classification_headers.1.weight: copying a param with shape torch.Size([126, 1024, 3, 3]) from checkpoint, the shape in current model is torch.Size([18, 1024, 3, 3]).
size mismatch for classification_headers.1.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([18]).
size mismatch for classification_headers.2.weight: copying a param with shape torch.Size([126, 512, 3, 3]) from checkpoint, the shape in current model is torch.Size([18, 512, 3, 3]).
size mismatch for classification_headers.2.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([18]).
size mismatch for classification_headers.3.weight: copying a param with shape torch.Size([126, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([18, 256, 3, 3]).
size mismatch for classification_headers.3.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([18]).
size mismatch for classification_headers.4.weight: copying a param with shape torch.Size([126, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([18, 256, 3, 3]).
size mismatch for classification_headers.4.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([18]).
size mismatch for classification_headers.5.weight: copying a param with shape torch.Size([126, 256, 3, 3]) from checkpoint, the shape in current model is torch.Size([18, 256, 3, 3]).
size mismatch for classification_headers.5.bias: copying a param with shape torch.Size([126]) from checkpoint, the shape in current model is torch.Size([18]). "
Please Help.
Thanks & Regards,
Dipankar Sil

AastaLLL · December 28, 2020, 7:51am

Hi,

Could you also add the --labels to specify your output class?

github.com

dusty-nv/pytorch-ssd/blob/e7b5af50a157c50d3bab8f55089ce57c2c812f37/onnx_export.py#L22


      
          from vision.ssd.mobilenetv1_ssd_lite import create_mobilenetv1_ssd_lite
          from vision.ssd.squeezenet_ssd_lite import create_squeezenet_ssd_lite
          from vision.ssd.mobilenet_v2_ssd_lite import create_mobilenetv2_ssd_lite
          
          

          
# parse command line
          parser = argparse.ArgumentParser()
          parser.add_argument('--net', default="ssd-mobilenet", help="The network architecture, it can be mb1-ssd (aka ssd-mobilenet), mb1-lite-ssd, mb2-ssd-lite or vgg16-ssd.")
          parser.add_argument('--input', type=str, default='', help="path to input PyTorch model (.pth checkpoint)")
          parser.add_argument('--output', type=str, default='', help="desired path of converted ONNX model (default: <NET>.onnx)")
          parser.add_argument('--labels', type=str, default='labels.txt', help="name of the class labels file")
          parser.add_argument('--width', type=int, default=300, help="input width of the model to be exported (in pixels)")
          parser.add_argument('--height', type=int, default=300, help="input height of the model to be exported (in pixels)")
          parser.add_argument('--batch-size', type=int, default=1, help="batch size of the model to be exported (default=1)")
          parser.add_argument('--model-dir', type=str, default='', help="directory to look for the input PyTorch model in, and export the converted ONNX model to (if --output doesn't specify a directory)")
          
          
args = parser.parse_args() 
          print(args)
          
          
# set the device
          device = torch.device('cuda:0' if torch.cuda.is_available() else 'cpu')

Thanks.

dipankar123sil · December 28, 2020, 10:25am

Thanks --labels worked in creating .onnx using onnx_export.py.py, but --input argument remains mandatory, else the issue mentioned in the first post remains.
So presently we need to manually note which epoch has the least loss and point that to script while creating onnx.
Thanks & Regards,
Dipankar Sil

dusty_nv · December 28, 2020, 6:26pm

PyTorch automatically adds the BACKGROUND class while training, so it is expected to find this line in the labels.txt that gets saved along with the model. That labels.txt should be used while exporting to ONNX. Normally that labels.txt with BACKGROUND does automatically get used while exporting to ONNX, I suppose except if the dataset directory and model directory was the same or the files were inadvertently copied.

Topic		Replies	Views
Python onnx_export.py shows error while trying to export model, please help Jetson Nano onnx	9	1208	January 11, 2022
Trying to regenerate onnx for Jetson Nano Jetson Nano onnx	7	1687	October 18, 2021
Evaluation of ssd mobile net Jetson Orin NX jetson-inference	2	15	February 12, 2025
Onnx_export.py outputs size mismatch for classification_headers.0.weight / bias errors Jetson Xavier NX jetson-inference	2	1896	October 18, 2021
Error while use onnx model in DS6.0 DeepStream SDK	9	734	March 8, 2022
Jetson nano start the Docker an error occurred while training your detection model ：Segmentation fault (core dumped) Jetson Nano jetson-inference	7	1234	April 21, 2022
Other Models for object detection Jetson Nano jetson-inference	18	1816	June 29, 2022
[TensorRT] ERROR: (Unnamed Layer* 0) [Convolution]: at least 5 dimensions are required for input - on Jetson Xavier Jetson Xavier NX tensorrt	7	1633	October 18, 2021
ONNX and tensorRT: ERROR: Network must have at least one output TensorRT	30	16879	October 6, 2020
Conversion of model weights for human pose estimation model to ONNX results in nonsensical pose estimation DeepStream SDK ubuntu , nvbugs , python	18	3239	October 12, 2021

Export onnx.py is failing for own camera captured data sets

Related topics