Gstdsexample plugin is slow: does GaussianBlur run on GPU?

ynjiun · August 25, 2020, 11:46pm

per your recommendation, sudo jetson_clocks (CPU 2.3GHz, GPU 1.4GHz) and sudo tegrastats, the situation doesn’t improve that much (still see frame hiccups):
from CPU loading and GPU loading observation,
CPU avg 2765mW mem 2.9GB
GPU avg 2000mW mem 868MB

interestingly if I comment out the code change and resume back without performing Gaussian Filter in cuda (i.e. in your code #if 1 … #endif change to #if 0 … #endif), then the frame rate is much smoother:
CPU avg 2910mW mem 2.7GB
GPU avg 2028mW mem 808MB

I tried to put the same Gaussian Filter into a nvivafilter implementation like here

Then I don’t need to run sudo jetson-clocks and only use 30W ALL mode (CPU 1.2GHz, GPU 905MHz) the resulting pipeline is much much smoother (this is what I would expect running on GPU speed):
CPU avg 1275mW mem 2.2GB
GPU avg 1155mW mem 605MB

Queston: does gstdsexample.cpp have a lot overheads (e.g. unneccessary buf copying back/forth from CPU mem to GPU mem) causing such slow performance (compared with nvivafilter implementation)?

Thank you for your insights in advance.

Topic		Replies	Views
Cuda blurring filter running too slow on gstdsexample using GpuMat! DeepStream SDK opencv , cuda , gstreamer	8	1749	October 12, 2021
gstreamer NVMM <-> opencv gpuMat Jetson TX2 opencv	58	22298	October 18, 2021
Get frame in GpuMat instead of Mat - OpenCV 3.4.2 - v4l2 - Jetson TX2 Jetson TX2	25	4364	October 18, 2021
How to create opencv gpumat from nvstream? DeepStream SDK	36	15162	July 27, 2021
Performane of Deepstrem gst-dsexample plugin Jetson Xavier NX	4	543	October 18, 2021
Preprocessing of frames - gst-dsexample DeepStream SDK	6	1609	October 12, 2021
Gstdsexample pipeline order matter? DeepStream SDK cuda , gstreamer	3	1008	October 12, 2021
OpenCV with Jetson Nano Slow Webcam frame rate Jetson Nano opencv	7	3573	October 15, 2021
sharing memory between CUDA and openmax codec(TX1)? or other fast data transfer? Jetson TX1	15	3989	October 18, 2021
Unknown key 'blur-objects' for group [ds-example] DeepStream SDK	4	1018	October 12, 2021

Gstdsexample plugin is slow: does GaussianBlur run on GPU?

Related topics