Jetson AGX Xavier GPU RAM usage for object detection and instance segmentation inferencing

user152354 · May 12, 2022, 11:47am

Hey there,

I am using a Jetson AGX Xavier with 16GB RAM memory for object detection and instance segmentation inferencing. However the algorithms are running very slowly compared to standard desktop computers with 8GB GPU.

I have run jtop to check if the GPU is properly utilized. Every time the Program makes a prediction, the GPU usage goes up to 100%. However, under the memory registry of jtop, I notice that the GPU always only utilizes about 2GB of RAM memory when performing inference.

Should I consider it normal that the GPU “only” uses 2GB of RAM or can I somehow make it use more at the time?

To clarify, I am running the inferencing with a batch size of 1 because I want to use it for real-time application on an image stream.
I am using a swin-transformer model for object-detection and instance segmentation. I have taken it from MMDetection which is pytorch based. Running it on the Jetson on using the API of MMDetection(pytorch), I am getting an inference time of 1s per image (1 fps). Then I converted it to ONNX and ran it in onnxruntime-gpu. Which led to a speedup where it currently runs at 0.5s per image (2 fps). But it’s still very slow for real-time applications. I am trying to convert the model further to TensorRT but it’s proving to be quite a challenge and I am unsure if it will help that much.

For that reason, I was asking myself if the problem might be somehow hardware/gpu related.

Any help or advice would be much appreciated!

AastaLLL · May 13, 2022, 3:31am

Hi,

It seems that you are using a third-party framework (PyTorch?) for inference. Is that correct?
If yes, it’s recommended to check with the provider to see if they do any optimization for Jetson.

Another alternative is to run the model with TensorRT.
For an ONNX model, you can convert it to TensorRT with our binary directly:

$ /usr/src/tensorrt/bin/trtexec --onnx=[model]

Thanks.

system · June 1, 2022, 4:07am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pytorch network -> onnx -> tensorrt performance(run frequency) question Jetson AGX Xavier tensorrt	7	586	December 7, 2023
Detection tesnorRT takes seconds to run on TX2 Jetson TX2 tensorrt , jetson-inference	8	787	October 18, 2021
Jetson Xavier NX not ble to run segmentation using GPU Jetson Xavier NX tensorflow , python , deep-learning , segmentation	4	1458	August 29, 2021
TensorRT model consuming more amount of RAM Jetson TX2 tensorrt	3	971	October 18, 2021
TensorFlow object detection inference out of memory Jetson Nano	7	3206	October 18, 2021
Using ONNX Runtime with TensorRT on Jetson Devices Jetson AGX Xavier tensorrt	5	1232	October 18, 2021
TensorRT model consuming more amount of RAM on Jetson TX2 Jetson TX2 tensorrt	5	1216	October 18, 2021
Slow object detection speed Xavier AGX 32GB Jetson AGX Xavier tensorrt , tensorflow	6	1349	October 18, 2021
Extremely slow inference in TensorRT for live semantic segmentation model Jetson AGX Xavier tensorrt , tensorflow , jetson-inference	11	4560	April 12, 2022
Jetson AGX Orin GPU Usage Jetson AGX Orin performance	3	3032	June 14, 2022

Jetson AGX Xavier GPU RAM usage for object detection and instance segmentation inferencing

Related topics