Slow memory copy from Device to Host with NvBufSurfaceMap API

v.hunglx2 · August 17, 2021, 4:50am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Hi, i am trying to use NvBufSurfaceMap to transform deepstream surface buffer in Jetson platform (Xavier AGX) and realize that it has very poor performance (2 GiB/s) because mappedAddr.addr of mapped surface is pageable memory. Can anyone from NVIDIA confirm and have a solution for this.
Thanks

DaneLLL · August 17, 2021, 6:54am

Hi,
If you allocate a CPU buffer and do copy by calling memcpy(), the performance can be capped by CPU. We would suggest create NvBufSurface so that you can call NvBufSurfTransform() to copy data to another buffer. It uses hardware VIC engine and is fast. You can call NvBufSurfaceMap() to get CPU-accessible pointer.

Topic		Replies	Views
How to copy NvBufSurface to NvBuffer DeepStream SDK	3	379	December 5, 2023
NvBufSurfTransform call is slow when copying GPU surface DeepStream SDK cuda	3	558	December 28, 2021
Gstreamer appsrc element use GPU data DeepStream SDK	2	387	March 27, 2023
Nvbufsurface: mapping of buffer (0) failed DeepStream SDK	3	749	October 12, 2021
Faster way to cache images on Jetson DeepStream SDK	8	403	June 21, 2023
NVBufserface understanding DeepStream SDK cuda	4	386	September 27, 2021
How can I access to device memory in nvbufsurface? DeepStream SDK	5	1453	October 12, 2021
NvBuffer to NvBufSurface copy without CPU DeepStream SDK	4	858	June 2, 2023
How to use NvBufSurfaceCopy to copy surface from CUDA device to CPU accessable memory DeepStream SDK	6	1982	October 12, 2021
How to copy NvBufSurface Attributes DeepStream SDK	3	508	July 15, 2022

Slow memory copy from Device to Host with NvBufSurfaceMap API

Related topics