Jetsonnano 2gb custom model live inference?

mdev77 · April 20, 2022, 7:20am

Hey Team NVIDIA,
I have recently been using a jetson nano 2gb for model deployment. I have converted a custom model (pose estimation application) from Pytorch to onnx and .trt and am trying to run a live inference on the jetson nano.

model reference:

I used the following documentation to convert an onnx model to .trt format and it works on random tensors
> TensorRT/4. Using PyTorch through ONNX.ipynb at master · NVIDIA/TensorRT · GitHub

However, I want to run a live inference using one of the formats and am not able to find a simple working code example from the repo. I have seen it is possible to load some of the existing models mentioned in the repo below but how can you do the same for a custom model. Some guidance or sample code would be helpful. Thanks
GitHub - dusty-nv/jetson-inference: Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

dusty_nv · April 20, 2022, 1:57pm

Hi @mdev77, the models that I run with jetson-inference poseNet are from this project: https://github.com/NVIDIA-AI-IOT/trt_pose

I have not used other custom models with it, and doing so would likely require addition pre/post-processing code to support your model.

Alternatively, you may want to try torch2trt tool which you can integrate directly with your PyTorch scripts to accelerate your model with TensorRT without much changes.

mdev77 · April 21, 2022, 3:30pm

Hey, thanks for getting back and also the great tutorials online. I have come across this repo and tried to use torch2trt with my model and the jetson is stuck for a long time. So I tried to use some sample code from the torch2trt website just as a starter and fails to convert and returns:
Segmentation fault (core dumped

import torch
from torch2trt import torch2trt
from torchvision.models.alexnet import alexnet

# create some regular pytorch model...
model = alexnet(pretrained=True).eval().cuda()

# create example data
x = torch.ones((1, 3, 224, 224)).cuda()

# convert to TensorRT feeding sample data as input
model_trt = torch2trt(model, [x])
torch.save(model_trt.state_dict(), 'alexnet_trt.pth')

I have managed to convert my model into .trt via onnx format but when i use opencv or jetson utils to run a live inference to the model it exits. How can I overcome this issue?

dusty_nv · April 21, 2022, 4:10pm

You would need to modify jetson-inference to use the pre/post-processing that your model expects. In my experience, there can be significant post-processing for pose estimations models. It may be easier for you to just use something like the ONNX Runtime, and use your existing Python application to do the pre/post-processing.

mdev77 · April 21, 2022, 5:44pm

Do you mean the onnx runtime via cuda? I have a working onnx solution and tried it on a ras pi 4GB but it’s slow. Also tried running PyTorch directly via cuda on the jetson nano and it failed

ex: my model takes in a tensor(1, 3, 256, 192) and outputs a tensor(1, 18, 48, 48) and then takes the max values in the heatmap and thresholding. If i have to put this an onnx model through the detectnet example (or pose estimation model) how can I modify the jetson inference? I did not find much documentation online though on how to do?

dusty_nv · April 25, 2022, 2:40pm

Yes, if you set the ONNX backend to CUDA while running it on Jetson, it should be faster.

Your model is a pose estimation model, so it wouldn’t run through detectnet. Since your model is of a different architecture, you would need to modify the pre/post-processing here:

It’s not documented because I don’t support arbitrary pose estimation models in jetson-inference.

mdev77 · May 6, 2022, 5:58pm

Hey dusty_nv,
I used a stacked hour glass this time and converted it into onnx cuda, tensor RT and torch2trt and was successfully able to do it by running a random input tensor (1,3, 256, 256) through the model for 50 epochs and it took 0.025 s per epoch. The problem arises when I call the inference with the camera (below code/OpenCV) and try to pass in the image the terminal stops showing a ram too low problem:

How can I call a live camera via jetcam on the terminal (references would help)? Thanks

import jetson.inference
import jetson.utils

net = stacked hourglass() #example
camera = jetson.utils.videoSource("csi://0")      # '/dev/video0' for V4L2
display = jetson.utils.videoOutput("display://0") # 'my_video.mp4' for file

while display.IsStreaming():
	img = camera.Capture()
        #preprocessing done here ...
	detections = net(img) # passing in the image to model
	display.Render(img)

dusty_nv · May 6, 2022, 6:09pm

Have you tried disabling ZRAM and mounting swap, and if needed disabling the GUI? You can try these suggestions:

If those don’t provide enough reduction in memory, since you are on Nano 2GB you may need to use a less complex model.

system · May 20, 2022, 6:09pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Question about using jetson-inference in production Jetson TX2 jetson-inference	7	1352	October 18, 2021
Cannot Convert Custom Model To TensorRT TensorRT	10	1769	October 12, 2021
PyTorch model loosing accuracy when converting to TensorRT TensorRT tensorrt	10	2726	July 26, 2021
Examples for Deployment of and Inference with Pretrained Custom PyTorch-Based Models on Jetson Orin Nano Jetson Orin NX pytorch	13	84	May 25, 2025
How to use onnx_to_tensorrt.py with a webcam TensorRT	2	1383	July 20, 2020
How to using jetson inference to training my hand datasets? Jetson Nano jetson-inference	4	40	November 4, 2024
Custom trained model on Jetson Nano Jetson Nano tensorrt	8	1838	October 15, 2021
Engine Plan Inference on JetsonTX2 Jetson TX2 tensorrt , python	11	1847	October 18, 2021
.onnx file convert to trt got error Jetson TX2 tensorrt , jetson-inference	19	1443	January 25, 2023
Jetson-Inference predictions differ from e.g. tensorflow predictions Jetson Nano jetson-inference	4	866	November 17, 2021

Jetsonnano 2gb custom model live inference?

Related topics