Camera DMA buffer to VPIImage as efficiently as possible

jcharest1 · April 20, 2024, 12:16am

I am using libargus to capture frames via DMA from a stereo camera. I am trying to construct a VPIImage from the frame, accessible to the VIC (bonus points for OFA and PVA as well) in the most efficient way possible (preferably ISP only, no CPU or VIC utilization to do so).

Potential input paths:

The DMA buffer file descriptor. Not sure if there’s anything I can do with this directly, but it seems plausible that I should be able to wrap it with a VPIImage. Or perhaps first wrap it in NvBuffer, then wrap with VPIImage.
An EGLStream::Frame. I can get this by configuring the argus OutputStream as STREAM_TYPE_EGL, then grabbing frames via a consumer.

    EGLStream::IFrameConsumer* consumer =
        Argus::interface_cast<EGLStream::IFrameConsumer>(cam.consumer);
    Argus::UniqueObj<EGLStream::Frame> frame(consumer->acquireFrame());

From this I can copy to NvBuffer using IImageNativeBuffer::copyToNvBuffer as is done in libargus samples. NvBuffer can be converted to VPIImage (at least theoretically, haven’t figured out the NvBuffer -->VPIImage conversion yet). The performance of this option is poor, and I haven’t even made the VPIImage yet. Copying the Frame to NvBuffer uses ~15% of the VIC per camera @30fps 1920x1200. I have 8 cameras and other algorithms I want to run on the VIC so the copy isn’t tenable.

EGLImageKHR. This is the most efficient method I have found so far. I can accomplish this by configuring the stream as such:

    Argus::UniqueObj<Argus::OutputStreamSettings> stream_settings(
        isession->createOutputStreamSettings(Argus::STREAM_TYPE_BUFFER));
    auto istream_settings =
        Argus::interface_cast<Argus::IBufferOutputStreamSettings>(stream_settings);
    istream_settings->setBufferType(Argus::BUFFER_TYPE_EGL_IMAGE);

Acquiring the filled buffer returns an Argus::Buffer which can be casted to EGLImageKHR. Wrapping in VPIImage can then be done like so:

VPIImageData data;
data.bufferType = VPI_IMAGE_BUFFER_EGLIMAGE;
data.buffer.egl = egl_image;
vpiImageCreateWrapper(&data, nullptr, VPI_BACKEND_VIC | VPI_RESTRICT_MEM_USAGE, &vpi_image);

Calling the vpiImageCreateWrapper uses ~5-10% of the CPU @30fps 1920x1200. Are there any tweaks I can make to this method, perhaps to buffer allocation that will allow the buffers to be used on the VIC (or other accelerators) without any memory operations when wrapping? Maybe I am being overly optimistic but video encode can be done without the CPU, so I was hopeful I could feed images directly to VIC, OFA, etc. without the CPU as well.

Any guidance is appreciated!

AastaLLL · April 22, 2024, 5:25am

Hi,

Please find below for the nvargus ↔ VPI sample:
https://elinux.org/Jetson/L4T/TRT_Customized_Example#VPI_with_Argus_Camera_-_nvarguscamerasrc

The sample wraps the VPI Image from NvBuffer.
Please note that the API is changed in VPI 2.x/3.x, but the overall wrapping approach is similar:

https://docs.nvidia.com/vpi/2.3/group__VPI__Image.html#ga3e7cf2520dd568a7e7a9a6876ea7995c

Thanks.

jcharest1 · April 22, 2024, 2:36pm

This uses the IImageNativeBuffer::copyToNvBuffer that I’ve already found to be inefficient. I suppose that means I’ve already figured out the optimal solution (i.e. option 3 with EGLImageKHR)?

jcharest1 · April 22, 2024, 2:38pm

Also, this example is even worse because it seems to allocate an entirely new buffer for the VPI image instead of wrapping (but I can’t test because I have vpi2).

jcharest1 · April 22, 2024, 11:46pm

Figured it out. The gst-nvarguscamera example performs poorly, but using EGLImageKHR, creating the first VPIImage with vpiImageCreateWrapper and subsequent images with vpiImageSetWrapper performs well.

AastaLLL · April 23, 2024, 6:00am

Hi,

This should depend on the use case.

Since cameras usually reuse the same buffer, in some cases (ex. filtering), copying the data to another buffer is preferred.
But if your use case is a read-only process, wrapping the VPI image should be optimal.

Thanks

system · May 22, 2024, 1:59am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
NvBuffer VPI Interoperability Jetson Nano vpi	6	1833	September 12, 2021
Converting an EGLImageKHR to a VPIImage throws an error Jetson AGX Orin vpi	11	93	December 4, 2024
How to Convert CUeglFrame to EGLImageKHR? Jetson TX2 cuda , opengl , vpi	6	932	December 14, 2022
Copy camera capture from nvbuffer to CPU DRAM is very slow using argus library Jetson TX2 camera	19	2063	February 23, 2022
Camera capture with libargus and push appsrc Jetson Orin NX camera	7	510	January 30, 2024
Performance optimization help Jetson TX2	19	1079	October 18, 2021
Tensorrt binding from NvBuffer Jetson Xavier NX tensorrt	8	426	January 19, 2023
How do I get image from cudaBayerDemosaic and connect to VPI? Jetson Nano vpi	17	1716	July 12, 2022
Using VPI in GStreamer Jetson AGX Orin camera , gstreamer , documentation , vpi	51	4949	March 8, 2023
how to get mipi YCbCr_420_888 frame data to a buffer? Jetson TX2	4	704	October 18, 2021

Camera DMA buffer to VPIImage as efficiently as possible

Related topics