unspecified launch failure in function caller using OpenCV for Tegra 2.4.13

ManuKlause · November 14, 2016, 10:28am

Hi everyone,

running this code:

gpu::GpuMat cv_imgLeft_gpu;
gpu::GpuMat cv_imgLeft_gpu_blur;
Size filter_size;
filter_size.width = 7;
filter_size.height = 7;

// create filter
cv::Ptr<gpu::FilterEngine_GPU> filter = gpu::createGaussianFilter_GPU(CV_16SC1, filter_size, gauss_sigma);

while() {
// get image, function is from camera supplier, image is stored in pointer ptrGrabResultLeft
CameraLeft.RetrieveResult(5000, ptrGrabResultLeft, TimeoutHandling_ThrowException);

// constructor for GpuMatrix headers pointing to user-allocated data, load image to GPU memory
cv_imgLeft_gpu = gpu::GpuMat(ptrGrabResultLeft -> GetHeight(), ptrGrabResultLeft -> GetWidth(), CV_16SC1,  
(uint32_t*)imageLeft.GetBuffer());

// apply filter
filter->apply(cv_imgLeft_gpu, cv_imgLeft_gpu_blur, cv::Rect(0, 0, cv_imgLeft_gpu.cols, cv_imgLeft_gpu.rows));
}

produces this error output in my terminal:

OpenCV Error: Gpu API call (unspecified launch failure) in caller, 
file /hdd/buildbot/slave_jetson_tx_4/35-O4T-L4T-R24-armhf/opencv/modules/gpu/src/cuda/row_filter.h, 
line 174 terminate called after throwing an instance of 'cv::Exception' what(): 
/hdd/buildbot/slave_jetson_tx_4/35-O4T-L4T-R24-armhf/opencv/modules/gpu/src/cuda/row_filter.h:174:
error: (-217) unspecified launch failure in function caller

I am using an TX1 with OpenCV for Tegra 2.4.13. The camera takes 12 bit images within a 16 bit mask. However, if I load the image fo cv Mat format and convert it to gpu Mat, it works fine. But the time it takes to convert the image is to long, I want to avoid that. What am I missing?

Thanks for your help.

Best ManuKlause

kayccc · November 24, 2016, 12:53am

Hi ManuKlause,

When create the cv::Mat first, then the GpuMat from it, things work. This tells that you have host memory from your camera input, so it must be explicitly uploaded to GPU memory before use. You can do some hiding of this latency if you use streams with asynchronous copy. Here is a nice presentation form GTC 2013 they should review for more info:

[url]http://on-demand.gputechconf.com/gtc/2013/webinar/gtc-express-itseez-opencv-webinar.pdf[/url]

Hope this could help on your case.

Thanks

ManuKlause · November 24, 2016, 9:09am

Hi kayccc, you were right. I have contacted the supplier of the camera. It seems, the cameras is not able to write the image to GPU memory directly. So, I have to write it to the host first, then upload it. I will try using asynchronous data now. Tank you.

By using asynchronous data, I am using the overlap mode, am I right?

kayccc · November 29, 2016, 12:58am

Hi ManuKlause,

Yes, by using asynchronous transfer as outlined in the section of the presentation called “Overlapping operations with CUDA” you will be able to minimize the total latency of your application.

Thanks

Topic		Replies	Views
OpenCV convertTo Failure Jetson TX2 opencv	24	8103	October 18, 2021
No matching function for call to 'cv::cuda::CLAHE::apply(cv::gpu::GpuMat&, cv::gpu::GpuMat&) Jetson TX2	2	1042	October 18, 2021
How to create opencv gpumat from nvstream? DeepStream SDK	36	15382	July 27, 2021
OpenCV 3.3 with gpu using problem Jetson TX1	8	2585	October 18, 2021
Unexpected opencv performance on TK1, and CUDA crash Jetson TK1	4	1244	October 18, 2021
gstreamer NVMM <-> opencv gpuMat Jetson TX2 opencv	58	22547	October 18, 2021
Get frame in GpuMat instead of Mat - OpenCV 3.4.2 - v4l2 - Jetson TX2 Jetson TX2	25	4418	October 18, 2021
vc2013, X64, the program always block at GpuMat device_edge(image_h2); Jetson TK1	8	2014	June 26, 2015
Translating CPU based OpenCV code to GPU based OpenCV code Jetson TX1 opencv	3	2772	October 18, 2021
VideoWriter_GPU error Jetson TX2	4	1088	October 18, 2021

unspecified launch failure in function caller using OpenCV for Tegra 2.4.13

Related topics