TensorRT process killed with Orin Nano

sumin.lee1 · July 12, 2023, 11:27pm

Hello,

We are trying to convert model to TensorRT.
But it is failed with the error as below :

We set like this
=> torch2trt option : dla=True, max_workspace_size=1GB
But it looks it doesn’t apply.

The conversion process is killed but we can get the output.
This output works and inferencing time is half than before conversion.

AastaLLL · July 13, 2023, 2:55am

Hi,

Which tool do you use for the conversion?
Is it trtexec or other tools?

Thanks.

doosikdonet · July 13, 2023, 8:31am

hello.
that’s error raise when convert Pytorch model to TensorRT using torch2trt (GitHub - NVIDIA-AI-IOT/torch2trt: An easy to use PyTorch to TensorRT converter) in python code

do i understand question correctly?

more environment info is …
device : jetson orin nano 4GB
test pytorch model : densenet121.
jetpack ver : 5.1.1

AastaLLL · July 17, 2023, 6:25am

Hi,

It looks like you try to convert the model into a DLA engine.
But Orin Nano doesn’t have the DLA hardware.

Please set dla=False and try it again.
Thanks.

doosikdonet · July 18, 2023, 4:44am

there is no error message about dla after change above option dla=False. thanks.
but still process killed again and i think the reason of this problem is out of memory.

when i test on jetson nano 4GB, ( cli mode, available RAM size 3.2 GB ) operation work well.
but jetson orin nano 4GB, (cli mode, available RAM size 2.2GB) process killed.

Can i solve this problem using torch2trt option? ( max_workspace_size, etc…)
test pytorch model is densenet121.

AastaLLL · July 19, 2023, 4:07am

Hi,

Yes, you can try a smaller batch and workspace value.

Could you also try to add some swap memory?
This will help if the compiling takes some host memory.

Thanks.

doosikdonet · July 19, 2023, 7:25am

thank you.
I tried the two methods you recommend, but swap memory didn’t solve this problem, probably for the reasons below.

but change the batch is worked. (change 8 to 4.) thanks.
but problem of change batch size is do inference 2 times.
it takes time spent 1.4 times more at inference.
i want keep my batch size for optimize my system

my question is,
Can you recommend another method for get more free RAM space?
(Stop certain services or other methods, etc. current available RAM size is 2.2GB in idle state, w/o GUI )
maybe 200MB space seems to be enough for pytorch model to TensorRT conversion and inference with keep my batchsize.

thank you for your support.

AastaLLL · July 21, 2023, 6:50am

Hi,

Loading cuDNN memory can take up to 600M or more.
There is a function called setTacticSources in TensorRT allows the user to deploy without calling cuDNN.

The function seems not been exported in torch2trt.
Is it possible to use TensorRT API directly?
This will require you to convert the model into ONNX format first.

Thanks.

doosikdonet · July 27, 2023, 6:50am

Hi,

Following your advice, I solved it in the following way.

pytoch model to onnx (torch.onnx.export())
i found the way use setTacticSource in torch2trt function (i’m not sure and I don’t know if it actually applied)
(use builder and config, in https://github.com/NVIDIA-AI-IOT/torch2trt/blob/master/torch2trt/torch2trt.py#L654)
onnx to tensorrt engine (trtexec) / trtexec option : --tacticSources=-CUDNN.
(include CUDNN tactic raised warning that memory not enough)

So i can use my batchsize and increase available RAM space almost 1GB !

thanks for your support.

system · August 16, 2023, 7:17am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Make full use of the swapfile Jetson Nano tensorrt	6	841	June 2, 2023
Converting tf model on jetson tx2 is slow TensorRT	14	1212	June 24, 2020
Jeston Nano 2GB Out of Memory With ONNX->TensorRT Conversion Jetson Nano tensorrt , nano2gb	2	980	October 15, 2021
Tensorrt Engine use too much memory TensorRT tensorrt	1	1567	December 13, 2021
Export Pytorch's Yolo5 model to ONNX Jetson Nano yolo , pytorch	8	2311	December 12, 2022
Unable to generate tensorrt engine for facenet model on Jatson Nano 2 GB TensorRT tensorrt , cuda	1	334	December 28, 2020
Downgrade Jetpack 5.0.2 to JetPack 5.0 Developer Preview Jetson Xavier NX tensorrt , pytorch	4	788	February 1, 2023
Converting FCN8-ResNet18 from Pytorch to TensorRT for inference on Jetson Nano TensorRT tensorrt , jetson-inference , pytorch , python , onnx	3	2229	October 12, 2021
API usage error of torch2trt on Jetson Orin nano Jetson Orin Nano pytorch	10	1430	September 12, 2023
Memory error for tensorRT model on TX2 Jetson TX2 tensorrt	6	1455	January 5, 2022

TensorRT process killed with Orin Nano

Related topics