Using mediapipe gpu accelerated python solutions with cuda accelerated opencv

ajinkyabhagat94.ab · October 25, 2021, 10:48am

I built mediapipe facemesh solution with gpu support on Jetson Xavier AGX with reference to https://forums.developer.nvidia.com/t/mediapipe/121120/33
After obtaining desired landmarks I also used OpenCV with CUDA support to develop my own code to detect iris of the eye.

JetPack version: 4.5.1
OpenCV version: 4.1.1
Mediapipe version: 0.8
Python version: 3.6.9
Cuda: 10.2

However despite using GPU support and CUDA accelerated OpenCV, the maximum FPS I have attained is around 14. I am using an USB camera which can give me maximum 30 fps for image size 640*480. How do I optimise my code?

AastaLLL · October 26, 2021, 2:36am

Hi,

Could you monitor the GPU utilization to see if all the GPU resources are used?

$ sudo tegrastats

More, have you maximized the device performance first?

$ sudo nvpmodel -m 0
$ sudo jetson_clocks

Could you also share how do you read the camera? Is it cv::VideoCapture()?

Thanks.

ajinkyabhagat94.ab · October 26, 2021, 4:51am

Thank you for the prompt reply!
After changing the power mode to maximize device performance, no change was observed.
This is the output of tegrastats while running the code. We can clearly see the GPU being utilized.

Yes I am reading the camera using cv2.VideoCapture(0).
I am calculating exponentially decaying average of fps using time module by python.
For reference -

current_fps = 1.0/(toc - tic)

fps = fps*0.95 + current_fps*0.05

In the above code
tic = time at the start of the code
toc = time after processing (at the end of the code)
current_fps = fps for this frame
fps = decaying average of fps

I followed this method to give us an idea of the overall average, instead of just the current frame.

AastaLLL · November 5, 2021, 5:58am

Hi,

Based on the log, the GPU utilization is low.
You can try to read the camera with GStreamer to see if it helps.

Below is an example for your reference:

Thanks.

ajinkyabhagat94.ab · November 15, 2021, 11:37am

Hi,
So I built OpenCV from source with Cuda support as you can see below -

Still there was no significant improvement.
Do you mean using gstreamer with OpenCV? Because I even tried that, again no significant improvement.
It will be very helpful if you could provide links for implementing gstreamer or jetson_multimedia_api.
Thanks!

DaneLLL · November 24, 2021, 3:36am

Hi,
Could you try this sample and check if you can achieve 30fps:
V4l2src using OpenCV Gstreamer is not working in Jetson Xavier NX - #3 by DaneLLL

It is a sample for showing camera preview. would like to know it this achieves target frame rate.

system · December 8, 2021, 3:37am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
OpenCV GStreamer Capture really slow Jetson Xavier NX opencv , gstreamer	4	5059	October 18, 2021
Speed up python USB camera read in python without opencv Jetson Xavier NX camera , opencv	3	2181	May 18, 2022
Built OpenCV with CUDA / GPU support. Need to display via gstreamer in python. Jetson Nano	2	3144	October 15, 2021
OpenCV on NVIDIA Jetson Xavier Jetson AGX Xavier opencv	5	2077	September 27, 2021
Reading CSI camera input directly to GPU memory Jetson Nano camera , gstreamer , jetson-nano	5	2813	December 4, 2022
Need advice: 4K video capture & writing performance with OpenCV Jetson Xavier NX opencv , gstreamer	6	2215	March 2, 2022
OpenCV extremely slow with gstreamer camera Jetson Nano opencv , gstreamer	6	2610	September 19, 2021
GPU Acceleration Support for OpenCV Gstreamer Pipeline Jetson Xavier NX opencv , gstreamer	17	8209	October 18, 2021
Nano not using GPU with gstreamer/python. Slow FPS, dropped frames Jetson Nano	22	16326	October 14, 2021
OpenCV - gstreamer use 100% CPU and 0% GPU Jetson Nano opencv , gstreamer	4	2327	October 15, 2021

Using mediapipe gpu accelerated python solutions with cuda accelerated opencv

Related topics