Problems in optimizing the resnet18 model

dankorf1967 · November 3, 2024, 6:38pm

Hello,
I am developing an engineering project with machine vision, I am following this tutorial GitHub - NVIDIA-AI-IOT/trt_pose: Real-time pose estimation accelerated with NVIDIA TensorRT and I downloaded the weights of the resnet18 model from this link Models - JetNet

to optimize the model I get the following error, using only the conversion part with torch2trt.

Conversion Script:

import torch
from torch2trt import torch2trt, TRTModule
import trt_pose.models
import os

Configuración de parámetros para el modelo

MODEL_WEIGHTS = ‘resnet18_baseline_att_224x224_A_epoch_249.pth’ # archivo de pesos
OPTIMIZED_MODEL = ‘resnet18_baseline_att_224x224_A_epoch_249_trt.pth’ # guardar modelo optimizado
HEIGHT, WIDTH = 224, 224 # tamaño de entrada para el modelo
num_parts = 18 # número de puntos clave (keypoints) en el modelo de pose humana
num_links = 21 # número de conexiones entre puntos clave

Carga del modelo base en PyTorch

model = trt_pose.models.resnet18_baseline_att(num_parts, 2 * num_links).cuda().eval()

Preparación de un tensor de entrada de ejemplo para la conversión

data = torch.zeros((1, 3, HEIGHT, WIDTH)).cuda()

Convertir y guardar el modelo solo si el archivo optimizado no existe

if not os.path.exists(OPTIMIZED_MODEL):
# Cargar los pesos del modelo de PyTorch
model.load_state_dict(torch.load(MODEL_WEIGHTS))

# Convertir el modelo de PyTorch a TensorRT con torch2trt
model_trt = torch2trt(model, [data], fp16_mode=True, max_workspace_size=1 << 25)

# Guardar el modelo optimizado
torch.save(model_trt.state_dict(), OPTIMIZED_MODEL)

print(“Conversión a TensorRT completada y guardada como:”, OPTIMIZED_MODEL)

Error:

[10/30/2024-20:51:28] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:28] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 44 * 512 / 1 = 4194304
[10/30/2024-20:51:28] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:28] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 44 * 512 / 1 = 4194304
[10/30/2024-20:51:28] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:28] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 44 * 512 / 1 = 4194304
[10/30/2024-20:51:28] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:28] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 44 * 512 / 1 = 4194304
[10/30/2024-20:51:28] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:28] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 44 * 512 / 1 = 4194304
[10/30/2024-20:51:28] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:28] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 44 * 512 / 1 = 4194304
[10/30/2024-20:51:29] [TRT] [E] 3: 1.cmap_up.0:0:DECONVOLUTION:GPU:kernel weights has count 2097152 but 4194304 was expected
[10/30/2024-20:51:29] [TRT] [E] 4: 1.cmap_up.0:0:DECONVOLUTION:GPU: count of 2097152 weights in kernel, but kernel dimensions (4,4) with 512 input channels, 512 output channels and 1 groups were specified. Expected Weights count is 512 * 4*4 * 512 / 1 = 4194304
Traceback (most recent call last):
File “conversion2.py”, line 29, in
model_trt = torch2trt(model, [data], fp16_mode=True, max_workspace_size=1 << 25)
File “/usr/local/lib/python3.6/dist-packages/torch2trt-0.5.0-py3.6-linux-aarch64.egg/torch2trt/torch2trt.py”, line 643, in torch2trt
outputs = module(*inputs)
File “/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py”, line 889, in _call_impl
result = self.forward(*input, **kwargs)
File “/usr/local/lib/python3.6/dist-packages/torch/nn/modules/container.py”, line 119, in forward
input = module(input)
File “/usr/local/lib/python3.6/dist-packages/torch/nn/modules/module.py”, line 889, in _call_impl
result = self.forward(*input, **kwargs)
File “/usr/local/lib/python3.6/dist-packages/trt_pose-0.0.1-py3.6-linux-aarch64.egg/trt_pose/models/common.py”, line 76, in forward
return self.cmap_conv(xc * ac), self.paf_conv(xp * ap)
File “/usr/local/lib/python3.6/dist-packages/torch2trt-0.5.0-py3.6-linux-aarch64.egg/torch2trt/torch2trt.py”, line 262, in wrapper
converter"converter"
File “/usr/local/lib/python3.6/dist-packages/torch2trt-0.5.0-py3.6-linux-aarch64.egg/torch2trt/converters/native_converters.py”, line 1496, in convert_mul
input_a_trt, input_b_trt = broadcast_trt_tensors(ctx.network, [input_a_trt, input_b_trt], len(output.shape))
File “/usr/local/lib/python3.6/dist-packages/torch2trt-0.5.0-py3.6-linux-aarch64.egg/torch2trt/torch2trt.py”, line 146, in broadcast_trt_tensors
if len(t.shape) < broadcast_ndim:
ValueError: len() should return >= 0

Best regards!

AastaLLL · November 4, 2024, 3:12am

Hi,

Some of our users have met a similar issue before.
Please find the topic below for discussion and solution info.

Thanks.

dankorf1967 · November 4, 2024, 7:14pm

Hi friend,

I went to the information indicated, but in my case as I am using jetson nano it gives me package version compatibility errors. To run the command “pip3 install onnx-graphsurgeon” it asks me to upgrade python to a version higher than 3.7 (no problem here), then I had problems with CMake, so I also had to upgrade. I reinstalled onnx, but now I had other problems, missing filesystem header and problems compiling numpy. That’s after updating CMake according to the following commands:

wget https://github.com/Kitware/CMake/releases/download/v3.18.4/cmake-3.18.4.tar.gz

tar -zxvf cmake-3.18.4.tar.gz
cd cmake-3.18.4
./bootstrap
make
sudo make install

This causes that now you can’t login to the terminal

AastaLLL · November 6, 2024, 6:35am

Hi,

You can check out TensorRT 8.2 so it might not require Python 3.7.
Also, the onnx graphsurgeon can be executed on other environments (ex. dGPU).
As the ONNX is a portable model format.

Thanks.

dankorf1967 · November 7, 2024, 8:55pm

Hello,
Thank you very much for your answer, but I found the pose estimation using Jetson-inference and the truth is that it works quite well, I will continue using this method.

Thanks for your help.

system · December 4, 2024, 12:59am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Converting FCN8-ResNet18 from Pytorch to TensorRT for inference on Jetson Nano TensorRT tensorrt , jetson-inference , pytorch , python , onnx	3	2349	October 12, 2021
Jetson nano road following jetracer optimization error during model conversion to trt model Jetson Nano tensorrt , cudnn , jetracer	7	146	August 1, 2024
Deploy trt_pose Jetson AGX Orin tensorrt	7	290	July 18, 2024
Error while converting my model to a TensorRT model. Not found: Container TF-TRT does not exist. (Could not find resource: TF-TRT/TRTEngineOp_0_0) TensorRT tensorrt	1	2609	December 9, 2021
Problem converting tensorflow model to TensorRT Jetson Nano tensorrt , tensorflow	5	470	March 26, 2024
Cannot Convert Custom Model To TensorRT TensorRT	10	1846	October 12, 2021
Building trtpose in x86 T4 TensorRT cudnn	20	711	July 3, 2024
Pytorch to tensorrt conversion using torch2trt on jetson nano Frameworks (archived) pytorch	1	1598	July 7, 2021
Pytorch model optimization in jetson nano Jetson Nano cuda , pytorch	7	1996	October 18, 2021
Detectnetv2 resnet18/resnet10 on jetson nano. TAO Toolkit	4	944	October 12, 2021