VPI + DeepStream is slower then expected (only wrapping)

darekdev · November 12, 2022, 7:26pm

• Hardware Platform (Jetson Nano)
• DeepStream Version 6.0.0
• JetPack Version (4.6)
• Issue Type( questions)

I have done some OpenCv prototype in python on my workstation but now I want to transfer algorithm to jetson deepstream with VPI.

I use this patch as reference and want to wrap DS frames to use in VPI algorithm: Deepstream SDK + VPI on Jetson tx2 - #21 by AastaLLL

And its working but not at full capabilities. For example I firstly want to do only wrapping: NvBufSurface → EGLImage->CUDA Inter->VPIWrapper:


    cuCtxSynchronize();

    memset(&data, 0, sizeof(data));
    data.format = VPI_IMAGE_FORMAT_RGBA8;
    data.numPlanes = surface->surfaceList[0].planeParams.num_planes;
    for(int i=0; i<data.numPlanes;i++){
        data.planes[i].width = surface->surfaceList[0].planeParams.width[i];
        data.planes[i].height = surface->surfaceList[0].planeParams.height[i];
        data.planes[i].pitchBytes = surface->surfaceList[0].planeParams.pitch[i];
        data.planes[i].data = egl_frame.frame.pPitch[i];
    }

    CHECK_VPI_STATUS(vpiImageCreateCUDAMemWrapper(&data, 0, &img));
    CHECK_VPI_STATUS(vpiStreamSync(ds_tracking_manager->vpi_stream));

    vpiImageDestroy(img);

    cuCtxSynchronize();

And without this I got 60fps as expected but with this code from vpi_wrap.patch I got 30-45fps.
What can cause this drop? Documentation say that there is no copy (only headers are copied) but I see fps dropped to much without any processing.

Best regards,
Darek

AastaLLL · November 14, 2022, 2:48am

Hi,

Please make sure you have maximized the device performance first.
Does the pipeline can reach 60 fps without the VPI wrapping?

Thanks.

darekdev · November 14, 2022, 7:10am

I found what cause this slowdown:

    CHECK_VPI_STATUS(vpiImageCreateCUDAMemWrapper(&data, 0, &img));

I don’t check it out yet if it is really same image (NvSurface == vpiImage) but change algorithm to this:

    if(ds_manager->img == nullptr) {
        vpiImageCreateCUDAMemWrapper(&data, 0, &ds_manager->img);
    } else {
        vpiImageSetWrappedCUDAMem(ds_manager->img, &data);
    }

And form me this is not clear for first:
vpiImageCreateCUDAMemWrapper

I thought that this is only wrapper but underlay there this function create new image:
[out] img Pointer to memory that will receive the created image handle.

Do I correctly understand that this function then:
vpiImageSetWrappedCUDAMem
is really wrapping? Creating underlay image (new one is really expensive). Copying not that much?

PS.
How should I draw lines/rectangle/circles on images? NvOSD or CUDA?

Best regards,
Darek

AastaLLL · November 15, 2022, 3:15am

There is no update from you for a period, assuming this is not an issue anymore. Hence we are closing this topic. If need further support, please open a new one.
Thanks

Hi,

The wrapping won’t create the buffer but only the wrapper handle.
vpiImageSetWrappedCUDAMem is used for redefining the wrapper to point to another memory.

Based on your use case, you can create the wrapper in the initial time.
And redefine the pointer with vpiImageSetWrappedCUDAMem (if the buffer pointer changes) when runtime.

Thanks.

system · December 19, 2022, 6:05am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using VPI in a Custom DeepStream Plugin DeepStream SDK gstreamer , vpi	4	1780	October 12, 2021
Performance issues using vpiImageCreate*Wrapper Jetson Xavier NX vpi	8	1935	February 9, 2023
Deepstream SDK + VPI on Jetson tx2 DeepStream SDK nvbugs	19	3799	October 12, 2021
OpenCV application uneven frame times Jetson Xavier NX opencv , performance , opencl	14	2792	January 19, 2022
vpiSubmitTemporalNoiseReduction fails with VPI_ERROR_INVALID_ARGUMENT on buffer created by vpiImageCreateWrapper/VPI_IMAGE_BUFFER_CUDA_PITCH_LINEAR Jetson AGX Orin cuda , vpi	5	47	December 30, 2024
VPI CUDA interop with managed memory Jetson AGX Xavier cuda , vpi	16	1734	October 18, 2021
Why Jetson vic has a significant performance drop? Jetson Xavier NX vpi	8	42	December 19, 2024
Convert frame from NvBufSurface (Deepstream 6.4) to VPIImage (VPI3.0) DeepStream SDK vpi	5	346	May 21, 2024
VPI 0.3.7 vpiImageWrapHostMem slow compared to algo Jetson AGX Xavier nvbugs , graphics	7	699	July 30, 2020
Deepstream is too slow with save picture DeepStream SDK	12	1016	May 9, 2023

VPI + DeepStream is slower then expected (only wrapping)

Related topics