Dimension mismatch error in Jetson-inference ssd model training

ranjeet_k · December 25, 2021, 3:40am

I was trying to follow your object detection tutorial for mobileNet SSD. I was getting poor detection with input image size 300. I change the size to 600 in configuration file in “vision/ssd/config” folder. I am getting the following error that I am unable to track.

Traceback (most recent call last):
File “train_ssd.py”, line 361, in
device=DEVICE, debug_steps=args.debug_steps, epoch=epoch)
File “train_ssd.py”, line 126, in train
regression_loss, classification_loss = criterion(confidence, locations, labels, boxes) # TODO CHANGE BOXES
File “/home/dtu-project2/.local/lib/python3.6/site-packages/torch/nn/modules/module.py”, line 1051, in _call_impl
return forward_call(*input, **kwargs)
File “/media/dtu-project2/2GB_HDD/jetson-inference/python/training/detection/ssd/vision/nn/multibox_loss.py”, line 38, in forward
mask = box_utils.hard_negative_mining(loss, labels, self.neg_pos_ratio)
File “/media/dtu-project2/2GB_HDD/jetson-inference/python/training/detection/ssd/vision/utils/box_utils.py”, line 202, in hard_negative_mining
loss[pos_mask] = -math.inf
IndexError: The shape of the mask [4, 3000] at index 1 does not match the shape of the indexed tensor [4, 11658] at index 1

The mask shape is [4, 3000] while training batch shape is [4, 11658]. The mask shape is coming from shape of labels. I am unable to track where is the labels’s shape getting set to [4, 3000]. Pls, guide me on how to do training with different input sizes.

Also, I could not find the pertaining model file for VGG-SSD on the website. Can you give the link for that?

Thank you in advance
Ranjeet

dusty_nv · January 12, 2022, 6:50pm

Hi @ranjeet_k, sorry that I didn’t see this post earlier (moving this to the Jetson forums)

You also need to change the SSD box specs in the code - please see these posts for changing it to 512x512:

system · February 2, 2022, 6:00am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How train jetson-inference ssd512 model Jetson TX2 jetson-inference , ssd , pytorch	14	3103	October 18, 2021
Training of Object Detection models on Jetson Nano! Jetson Nano ai-training	7	1301	October 18, 2021
Jetson Nano - train_ssd.py example doesn't detect anything Jetson Nano ai-training	8	1593	October 15, 2021
Re-training SSD-Mobilenet: gt_locations consist of nan values which causing Regression Loss to NaN Jetson Nano ai-training	2	929	September 13, 2022
Train_ssd.py indices error Jetson Nano jetson-inference	12	1737	December 15, 2021
Jetson-inference Jetson Nano jetson-inference	8	63	May 7, 2025
Successful training with "train_ssd.py" using small custom data set, but error on full data set Jetson Nano ai-training	6	1839	October 18, 2021
Retraining ssd_mobilenet on Jetson nano Jetson Nano neural-network-framework	12	1523	October 15, 2021
Train_ssh.py only works with one dataset; other one returns Loss: nan Jetson Nano ai-training	4	635	October 15, 2021
Bad object detection predictions on SSD Mobilenet v2 lite Jetson Nano jetson-inference , ssd , pytorch , deep-learning	4	2053	January 6, 2022

Dimension mismatch error in Jetson-inference ssd model training

Related topics