CenterNet keypoint detector not giving good FPS in jetson xavier NX

prasanna2 · June 16, 2021, 2:04pm

I’m try to use CenterNet Resnet50 V1 FPN Keypoints 512x512 model from tensorflow object detection API.

I have converted this saved_model to TensorRT model by using tf-trt converter but the model size after conversion was around 800MB which is uncommon and while inferecing on the Xavier NX it takes more than 20GB RAM, almost 1 hour to load the model but even after i was getting only 3 FPS.

I have tried also with ONNX runtime but the result is same.

kindly help on this.

Thanks

AastaLLL · June 17, 2021, 2:47am

Hi,

How do you measure the 20GB RAM usage?
Since NX only has 8GB physical memory, does the inference also use swap memory?

More, since you get the similar result with onnxruntime.
The high memory usage should come from model itself.

Have you tried it on a desktop GPU?
If yes, could you share the memory usage on an x86 environment with us?

Thanks.

prasanna2 · June 18, 2021, 9:24am

Hi AstaLLL,

yes, I have extended the swap memory.

but the model loading memory is reduced to 4GB with few solutions,
however the FPS remains the same.

In my laptop with 1650max-Q GPU it gives around 13FPS.
Im using Python with Tensorflow implementation.

we have sucessfully converted the model to ONNX format and inferenced with ONNX runtime in the xavier NX but there is not much improvement in the FPS (3-4FPS)

I have tried to convert the ONNX to Nvidia tensorRT but getting the below error

mirrag@mirrag-desktop:~/Downloads$ python3 createengine.py

Unsupported ONNX data type: UINT8 (2)

Traceback (most recent call last):

File “createengine.py”, line 19, in

engine = eng.build_engine(onnx_path, shape= shape)

File “/home/mirrag/Downloads/engine.py”, line 21, in build_engine

network.get_input(0).shape = shape

AttributeError: ‘NoneType’ object has no attribute ‘shape’

This is the link we have tried to convert https://developer.nvidia.com/blog/speeding-up-deep-learning-inference-using-tensorflow-onnx-and-tensorrt/

Thanks

AastaLLL · June 25, 2021, 5:59am

Hi,

XavierNX is an embedded device.
It’s expected that 1650 will have a better performance compared to Jetson.
You can try to inference it with fp16 or int8 for extra acceleration.

The unsupported data type error is a known issue.
It is caused by the different default input data type between ONNX and TensorRT.
Please check below comment for the solution:

Thanks.

system · August 29, 2021, 7:02am

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Issue deploying custom Tensorflow model on Xavier NX Jetson Xavier NX tensorflow	6	664	March 30, 2022
TF-TRT does not speed up the model Jetson AGX Xavier tensorrt , tensorflow	4	1359	September 5, 2021
Low FPS on Jetson Nano using TensorRT Jetson Nano tensorrt , tensorflow	7	1204	August 27, 2020
High RAM consumption with CUDA and TensorRT on Jetson Xavier NX Jetson Xavier NX tensorrt	10	2805	October 18, 2021
Centernet model TensortRT file type conversion TensorRT tensorrt , tensorflow , onnx	1	785	November 2, 2022
Object detection on jetson xavier nx development kit Jetson Xavier NX jetson-inference	4	1340	October 17, 2021
Importing a ONNX model for performing an inference using TensorRT Jetson Nano tensorrt	5	2849	October 15, 2021
How to convert Tensorflow model to Tensorrt? Jetson Nano tensorrt , tensorflow	8	2350	October 15, 2021
TensorFlow object detection inference out of memory Jetson Nano	7	3035	October 18, 2021
Inference onnx failed, Unsupported ONNX data type: UINT8 (2) Jetson Xavier NX tensorrt , tensorflow , jetson-inference , python	2	1476	October 10, 2021

CenterNet keypoint detector not giving good FPS in jetson xavier NX

Related topics