Trt_pose Boost performance on Jetson Xavier

ismael.ghouddana · May 7, 2020, 9:33am

Hi Nvidia team,

I m working on the git repository of trt_pose (GitHub - NVIDIA-AI-IOT/trt_pose: Real-time pose estimation accelerated with NVIDIA TensorRT), which is amazing, using Jetson Xavier and ROS and I would like to ask you some questions about it:

How did you get an fps number of 251, the maximum I can get is 30 using jetson clocks?
How can I use bigger models like Resnet50? Are you planning to provide them?
Will the performance of the Resnet18 model decrease if I increase the resolution to 640x480?
How can I improve the detection of human keypoints during movement or long distances?

Kind Regards,

Thanks

jaybdub · May 12, 2020, 6:29pm

Hi ismael.ghouddana,

Thanks for reaching out!

Question 1 - Framerate

This was measured a while ago using the following code

t0 = time.time()
torch.cuda.current_stream().synchronize()
for i in range(50):
    output = model_trt(input)
torch.cuda.current_stream().synchronize()
t1 = time.time()

print(50.0 / (t1 - t0))

30FPS sounds low. Do you mind sharing your system configuration?

What power mode is the Jetson in (you can find this with nvpmodel -q)
What version of JetPack are you using?

Question 2 - Bigger models

Currently, there are no plans to provide bigger models. The current goal was targeting usable accuracy within several meters, at high framerates. However, I’m curious what issue you’re running into with the existing models. This would help me better understand use cases we may not be addressing.

Question 3 - Larger resolutions

Yes, you can expect the framerate will drop as you increase the resolution. Depending on your application, this may be acceptable. For example, if you’re monitoring a larger group of individuals farther away and extremely high framerates are not necessary, you may benefit from increasing the resolution.

Question 4 - Accuracy during movement

One way to improve the accuracy over time may be to use a Kalman filter for each keypoint

Perform Kalman prediction step for each keypoint. This uses motion model for keypoint to give estimate of keypoint in new frame.
Detect keypoints for all individuals in current frame
Match new detections to detections in previous frame.
Perform Kalman update step for each keypoint that has a match in new frame. This refines estimate of keypoint in current frame. Keypoints without match would rely on prediction from previous frame.

This is one way to incorporate data from previous frames. I’m really not sure how well this would work in this context though since I haven’t tested.

Please let me know if this helps or you have any other questions.

Best,
John

Topic		Replies	Views
Can trt_pose run on dGPU? Jetson AGX Xavier tensorrt	5	461	October 18, 2021
Real time Human Pose detection on Jetson AGX Xavier Jetson Projects tensorrt , cuda , deep-learning-profiler	1	1286	November 25, 2020
Have anyone body 25 model tensorrt Jetson AGX Xavier tensorrt	4	845	October 18, 2021
Pose estimation using TRT (trt_pose) - slightly lower framerates than stated in inference Jetson Nano tensorrt	12	3800	October 15, 2021
Slow object detection speed Xavier AGX 32GB Jetson AGX Xavier tensorrt , tensorflow	6	1288	October 18, 2021
Hyperpose Performance on Xavier NX Jetson Xavier NX tensorrt	4	698	October 18, 2021
AGX Xavier Openpose at 3 FPS. What am I doing wrong? Jetson AGX Xavier jetson-inference	2	1059	October 18, 2021
Object Detection models and FPS Jetson TX2	3	1104	September 18, 2019
Best accuracy model for the Jetson Xavier Jetson AGX Xavier	8	917	October 18, 2021
Slow inference using tensorrt sampleFasterRCNN, 320ms/frame Jetson TX2	5	1470	October 18, 2021

Trt_pose Boost performance on Jetson Xavier

Question 1 - Framerate

Question 2 - Bigger models

Question 3 - Larger resolutions

Question 4 - Accuracy during movement

Related topics