opencv4tegra vs. opencv3.1 w/ cuda

kelbyg · May 19, 2016, 8:56pm

I am developing a real-time video processing application for the TX1 using opencv. Instead of using opencv4tegra, I have been forced to use opencv 3.1 with CUDA due to the video input stream bug (https://devtalk.nvidia.com/default/topic/929483/jetson-tx1/opencv-videocapture-usb-camera/). During testing, I’ve noticed that opencv4tegra is an order of magnitude faster than opencv 3.1 with CUDA in the following convolution code:

opencv4tegra (~3ms per filter)

cv::Mat frame(360, 640, CV_8UC1);
randu(frame, cv::Scalar::all(0), cv::Scalar::all(255));

cv::Mat element = cv::getStructuringElement(cv::MORPH_RECT, cv::Size(20, 20));
cv::Mat result;
for(int i = 0; i < 100; ++i){
  // start benchmark
  cv::morphologyEx(frame, result, CV_MOP_CLOSE, element);
  //stop benchmark
}

opencv3.1 w/ CUDA (~30ms per filter)

cv::Mat frame(360, 640, CV_8UC1);
randu(frame, cv::Scalar::all(0), cv::Scalar::all(255));
cv::cuda::GpuMat frameGPU;
frameGPU.upload(frame);

cv::Mat element = cv::getStructuringElement(cv::MORPH_RECT, cv::Size(20, 20));
cv::Ptr<cv::cuda::Filter> closeFilter = cv::cuda::createMorphologyFilter(cv::MORPH_CLOSE, CV_8UC1, element);
cv::cuda::GpuMat result;
for(int i = 0; i < 100; ++i){
  // start benchmark
  closeFilter->apply(frameGPU, result);
  // stop benchmark
}

Any idea what could be causing this huge discrepancy? I am accounting for the CUDA initialization time.

Honey_Patouceul · July 28, 2016, 10:25pm

It may depend heavily on the build options. I would advise to get the build options of OpenCV4Tegra (there is a function that gives the options used to build it). Skip the definition HAVE_TEGRA_OPTIMIZATION, as it requires non open source files.
Check also for processor flags (get your target processor features with the flags of /proc/cpuinfo) and enable them in the cmake config.

Topic		Replies	Views
OpenCV Benchmarks: opencv_perf_gpu Jetson TX1 opencv	2	2655	March 28, 2016
OpenCV 3.1 with USB camera support Jetson TX1 opencv	9	9335	July 16, 2018
Opencv4Tegra GPU vs CPU TK1 vs TX1 Jetson TX1 opencv	3	3669	April 28, 2016
How to use OpenCV with CUDA support Jetson TX1	2	1509	October 18, 2021
OpenCV4Tegra doesn't support GPU Jetson TX2	11	864	October 18, 2021
Building OpenCV from source Jetson AGX Xavier opencv	3	627	October 18, 2021
OpenCV4Tegra Problems CUDA Programming and Performance opencv	1	464	June 13, 2017
OpenCV Tegra optimizations needed 60 FPS Jetson TX2	3	507	October 18, 2021
Visionworks : Test about energy efficiency and performance Jetson TX1	4	775	October 18, 2021
Opencv video processing is slow Jetson TX1	6	2227	October 18, 2021

opencv4tegra vs. opencv3.1 w/ cuda

Related topics