Equivalent of jetson.uitls.cudaFromNumpy in C++

andrea_Faction · October 5, 2021, 6:38pm

I need convert and load a cv::Mat into CUDA memory. This is what I was planning on doing but apparently the AGX’s OpenCV does not have this package:
opencv2/cudaimgproc.hpp

from the compiler:
fatal error: opencv2/cudaimgproc.hpp: No such file or directory
#include <opencv2/cudaimgproc.hpp>

My approach to cuda from numpy:

cv::Mat rgb(frame->rows, frame->cols, frame->type());
cv::cvtColor(*frame, rgb, cv::COLOR_BGR2RGB);
uint8_t *imgPtr;
cv::cuda::GpuMat cuda_img;
cuda_img.upload(rgb);
cudaMalloc((void **)&imgPtr, cuda_img.rows * cuda_img.step);
cudaMemcpyAsync(imgPtr, cuda_img.ptr<uint8_t>(), cuda_img.rows * cuda_img.step, cudaMemcpyDeviceToDevice);

dusty_nv · October 5, 2021, 6:55pm

Hi @andrea_Faction, I have a precompiled package of OpenCV 4.5 with CUDA enabled that you can install similar to how it is done in this Dockerfile:

https://github.com/dusty-nv/jetson-containers/blob/d58ce7eb0afbb3c2706fc62d26f69e7055384484/Dockerfile.ros.foxy#L88

The secondary copy you are doing (with cudaMemcpyAsync) may be unnessary - you should be able to directly access the pointer in the cv::cuda::GpuMat somehow.

Also if you are using cudaMallocMapped() from jetson-inference, you don’t need to use cv::cuda::GpuMat at all, you can simply memcpy() from the cv::Mat into the memory allocated by cudaMallocMapped()

https://github.com/dusty-nv/jetson-inference/blob/master/docs/aux-image.md#image-allocation

andrea_Faction · October 6, 2021, 4:23am

Thank you @dusty_nv . I am using cudaAllocMapped and memcpy but I am pretty sure I am not copying the data properly. When I try to use the void pointer, I get a cuda mapped memory error.

cv::Mat rgb(frame->rows, frame->cols, frame->type());
// convert to RGB to get ready for CUDA image
cv::cvtColor(*frame, rgb, cv::COLOR_BGR2RGB);
uchar3 *cuda_img = NULL;
if (!cudaAllocMapped(&cuda_img, mask_size_))
{
      /// \todo handle the error
}
std::memcpy(cuda_img, rgb.data, sizeof rgb.data);
if (net_->Process(cuda_img, mask_size_.x, mask_size_.y, IMAGE_RGB8,
                      ignore_class_)){
....
}

[TRT]    ../rtSafe/cuda/caskConvolutionRunner.cpp (317) - Cuda Error in allocateContextResources: 700 (an illegal memory access was encountered)
[TRT]    FAILED_EXECUTION: std::exception
[TRT]    failed to execute TensorRT context on device (null)

dusty_nv · October 6, 2021, 4:28pm

How are you computing mask_size_?

andrea_Faction · October 6, 2021, 10:37pm

mask_size_ = make_int2(frame->cols, frame->rows);

frame is a cv::Mat

dusty_nv · October 7, 2021, 4:48pm

My suggestion is to comment out the cv::cvtColor() and std::memcpy(), this will let you know if the error is related to the size of memory allocated by cudaAllocMapped(). The model should still run on blank memory, as long as the allocation wa big enough.

system · October 21, 2021, 4:48pm

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Handing off cudaImage object to OpenCV CUDA function? (expects CV::MAT) Jetson Xavier NX opencv	12	4106	October 18, 2021
How to copy cv::cuda::GpuMat to jetson-utils image Jetson AGX Orin opencv , cuda , jetson-inference , image-processing	4	114	May 6, 2025
Memory allocate problem of cudaFromNumpy, using with opencv Jetson Xavier NX opencv , cuda	3	1455	October 18, 2021
Translating CPU based OpenCV code to GPU based OpenCV code Jetson TX1 opencv	3	2771	October 18, 2021
How to get opencv images from argus/syncsensor sample? Jetson Nano camera , opencv	2	969	October 15, 2021
pass pointer from CU file to opencv cv::cuda::GpuMat.data Jetson Nano opencv	8	3169	September 11, 2019
Eliminate upload/download for OpenCV cuda::GpuMat using shared memory? Jetson Nano opencv	14	21379	October 14, 2021
videoSource.Capture()'s cudaImage to numpy Jetson Nano cuda , jetson-inference	9	3861	October 18, 2021
Using cv::cuda::GpuMat instead of cv::Mat in gstdsexample DeepStream SDK	8	3216	September 18, 2021
Python: cudaImage <-> OpenCV conversions very slow Jetson Nano cuda	11	931	January 25, 2023

Equivalent of jetson.uitls.cudaFromNumpy in C++

Related topics