Jetson Nano: Deepstream Plugin Memory Management for OpenVX

metto · August 20, 2020, 9:11am

My setup:

• Jetson Nano
• DeepStream 5.0
• JetPack 4.4
• TensorRT 7.1

Hello fellow nvidia developers,
I have read the amazing documentations of “Deepstream”, “CUDA for Tegra” and “Gstreamer” but failed to grasp the key concepts of shared SoC DRAM management. As far as I understand even if the memory is the same Hardware-wise, the cache usage and addressing differs between Device Memory, Pageable Host Memory, Pinned Memory, Unified Memory and Surface Array . I get that I should choose the memory type according to the process device, for example; Surface Array for Jetson iGPU operations. Please correct me if I am wrong so far.
So with these in mind I am trying to write a custom plugin for Deepstream and while inspecting the provided gst-dsexample I have failed to understand which type of memory does GstBuffer uses initially. Also I want to process frames on iGPU using OpenVX and I know OpenVX works on GPU but since I don’t know the initial memory type, I am clueless on how to pass frame data.
So basically my questions are : How to pass frame from GstBuffer to vximage while maximizing performance on iGPU? Do I need to use an EglImage instance or is an NvBufSurface instance?
I have also noticed concept of Zero-copy but I don’t know if that can be applied here.
Thanks in advance. :^)

DaneLLL · August 21, 2020, 12:17am

Hi,
We don’t have experience of using vximage. Is it a CUDA buffer? If yes, you may refer to the sample code:

It demonstrates calling NvBufSurfaceMapEglImage() to get EglImage and cuGraphicsEGLRegisterImage(), cuGraphicsResourceGetMappedEglFrame to get CUDA pointer. If vximage is CUDA buffer, you can move data through the pointer.

metto · September 3, 2020, 7:56am

Hello again,
I have managed to create an openvx image using surface address pointers with code below:

  NvBufSurfaceMap (surface, frame_meta->batch_id, 0, NVBUF_MAP_READ_WRITE);
  NvBufSurfaceSyncForDevice(surface, frame_meta->batch_id, 0);

  mat_addr.dim_x = dsexample->processing_width;
  mat_addr.dim_y = dsexample->processing_height;
  mat_addr.stride_x = RGBA_BYTES_PER_PIXEL;
  mat_addr.stride_y = RGBA_BYTES_PER_PIXEL * dsexample->processing_width;

  src1 = vxCreateImage(dsexample->context,dsexample->processing_width,dsexample->processing_height,VX_DF_IMAGE_U8);
  dsexample->vxInp = vxCreateImageFromHandle(dsexample->context, VX_DF_IMAGE_RGBX, &mat_addr, (void* const*)surface->surfaceList[0].mappedAddr.addr[0], VX_IMPORT_TYPE_HOST );

  NvBufSurfaceSyncForDevice(surface, frame_meta->batch_id, 0);
  
  status = vxGetStatus((vx_reference)dsexample->vxInp);

  status1 = vxGetStatus((vx_reference)src1);

I get “0” from status values which indicates everything is OK but when I try to manipulate data using openvx functions I get segmentation error. I suspect it is because I am failing to Map the Gstbuffer data to a NvBufSurface instance as a read/write buffer accessible by GPU. Is it because I don’t use EglImage ? Or am I doing everything wrong?

DaneLLL · September 7, 2020, 11:03pm

Hi,
Not sure how vxCreateImageFromHandle() works. Does it work if you allocate a buffer with malloc() like

void *ptr = malloc(width*height*4); // RGBA
vxCreateImageFromHandle(dsexample->context, VX_DF_IMAGE_RGBX, &mat_addr, (void* const*)ptr, VX_IMPORT_TYPE_HOST );

metto · September 8, 2020, 8:02am

Hello @DaneLLL ,
thank you for your time, I have figured it out thanks to one of your older posts. For those who come across the same issue here is the proper way to create vxImage from NvBufSurface.

First comes the convertion from NvBufSurface to EGLImage. Original code is from DaneLLL’s older post

1->NvBufSurfaceMemSet ();
2->NvBufSurfaceMapEglImage ();
3->cuGraphicsEGLRegisterImage();
4->cuGraphicsResourceGetMappedEglFrame);
5->cuCtxSynchronize();

then;

vx_image vxInp = vxCreateImageFromHandle(context, VX_DF_IMAGE_RGBX, &vx_imagepatch_addressing, eglFrame.frame.pPitch, NVX_MEMORY_TYPE_CUDA);

After this NvBufSurface should be accessible by OpenVX properly for processing without any need of CPU access or OpenCV convertion.

DaneLLL · September 8, 2020, 11:07pm

Many thanks for the sharing.

Topic		Replies	Views
Add nvvidconvert element in deepstream-app DeepStream SDK	8	2556	October 12, 2021
Nvivafilter: different input and output buffers Jetson Nano gstreamer	9	2204	October 15, 2021
Help \| Mapping NVMM buffers with CUDA using EGLImageKHR and CUeglFrame, Jetson Nano running 4.5.1 Jetson Nano cuda , gstreamer	4	747	May 13, 2022
Error generated while running the code after connecting the camera Jetson Xavier NX gstreamer , nvbugs	45	1246	January 2, 2024
NVIDIA Gstreamer nvvidconv question Jetson Xavier NX gstreamer	5	2589	October 18, 2021
How to share NvBufSurface between processes DeepStream SDK	5	146	July 12, 2024
Copy image data to custom array from IFrame of EGLStream Jetson Xavier NX mmapi	7	335	May 30, 2024
Save frames extracted from Deepstream pipeline in C++ OpenCV DeepStream SDK	8	869	October 30, 2023
Using Cuda Memory type in Deepstream on Jetson Xavier DeepStream SDK opencv , cuda , gstreamer	2	1200	October 2, 2021
How to put image buffer to EGLimage? Jetson Nano	12	2601	October 18, 2021

Jetson Nano: Deepstream Plugin Memory Management for OpenVX

Related topics