Poor read/write performance to/from mapped memory on Xavier AGX

RokiaDiarr · April 23, 2021, 1:53pm

Hi,

I have one Xavier AGX connected to a host x86 PC. They are connected with a PCIe connector and use the PCIe x16 external slot of Xavier AGX. The Xavier AGX is configured as an endpoint device (NVIDIA RAM Memory).

In the endpoint function driver (‘pci-epf-nv-test.c’ located in L4T kernel source code (kernel/nvidia/drivers/pci/endpoint/functions directory), necessary codes have been added in order to allocated 256Mo dma memory with dma_alloc_coherent.

Informations of memory allocated with dma_alloc_coherent are exported and used by another pcie driver. In this second driver, dma_mmap_coherent is used to map the memory allocated with dma_alloc_coherent in the ‘pci-epf-nv-test.c’ driver. Then user can access to this memory by calling mmap on the character device created by the second driver.

All work fine, a user application can read and write to/from this mmapped memory. However, copy 256MB from this mmapped memory to a local buffer (allocated with malloc for exemple) has real poor performance (around 78MB/s). Writing from a local buffer (allocated with malloc) to the mmapped memory has also poor performance (around 1.5GB/s). Performance of a copying between two locals buffers is around 6GB/s.

How can I improve the performance of reading/writing to/from the mmaped memory ?

Thanks !

RokiaDiarr · April 27, 2021, 9:39am

Please, need help. Any suggestion ? Thanks.

omp · April 29, 2021, 4:10pm

Can you share you modified driver files, how you measuring throughput?

RokiaDiarr · April 30, 2021, 6:54am

Hi, I can share privately with you our modified drivers files and our current test application. How can I send to you privately these files ? Thanks

TomNVIDIA · April 30, 2021, 3:33pm

Hi @RokiaDiarr, You can send files through a private message. Click on the member avatar and hit the “message” button.

Best,
Tom

RokiaDiarr · May 3, 2021, 9:00am

Hi @TomNVIDIA , @omp

Sorry for the late, we were in weekend. I send you our modified drivers files there are few minutes ago. Thank

RokiaDiarr · May 5, 2021, 9:27am

Hi @TomNVIDIA , @omp

Have you received the files I sent you?

TomNVIDIA · May 5, 2021, 5:30pm

Hi @RokiaDiarr,

Unfortunately I am not a technical resource, @omp will need to look at the file.

Best,
Tom

RokiaDiarr · May 12, 2021, 9:21am

Hi @omp,

Any update ? Tanks

omp · May 12, 2021, 11:44am

Sorry for the delay in reply…
We are discussing this issue with our Memory team

vidyas · May 12, 2021, 12:30pm

Can you please try using dma_alloc_coherent() with dma_alloc_writecombined() and perform an explicit dsb() before letting the data accessed by the userspace code?

RokiaDiarr · May 17, 2021, 7:32am

Apologies for the delay, we have just returned from a long weekend.
I’m not sure to understand well : I must replace dma_alloc_coherent() by dma_alloc_writecombined() ?
And how can I perform an explicit dsb() ? (dsb() in defined in which file ?). I tried to find on Google, but I don’t found any clear response. Thanks

vidyas · May 18, 2021, 3:52pm

dsb() can be found in arch/arm64/include/asm/barrier.h file

puneets · May 26, 2021, 12:28pm

With dma_alloc_writecombined and subsequent call to dsb(sy) will improve the write performance only.

Another option -
In pci device dt node, add ‘dma-coherent’ property. As Xavier is io-coherent soc, no need to of cache operation if this property is set in dt node.
Did you enable IOMMU for PCI device?

Topic		Replies	Views
Cache Coherency Issue when writing to shared memory from User space? Jetson AGX Xavier pcie	4	604	February 28, 2024
Slow CPU access to memory mapped DMA buffer Jetson AGX Xavier	8	2440	October 18, 2021
Xavier pcie share memory access from user space Jetson AGX Xavier pcie , kernel	4	719	February 1, 2023
AGX Endpoint PCIe DMA speed Jetson AGX Xavier pcie	8	1670	October 26, 2022
AGX Xavier PCIe - real read performance. Jetson AGX Xavier	4	1422	October 18, 2021
Help for writing a dma driver for data transfert betwen two xavier connected by PCIe Jetson AGX Xavier pcie	2	743	October 18, 2021
Xavier PCIe performance Jetson AGX Xavier	13	2903	November 25, 2019
Xavier pcie share memory access too slow Jetson AGX Xavier pcie , board-design	6	618	February 7, 2023
Xavier memcpy speed is unstable Jetson AGX Xavier	16	1524	October 18, 2021
PCIe DMA driver compatibility with Xavier SMMU/IOMMU Jetson AGX Xavier	2	1820	October 18, 2021

Poor read/write performance to/from mapped memory on Xavier AGX

Related topics