Problems encountered in testing PCIE EP/RP network functions

NVIDIA工程师,你好:
我在调试PCIE C5 EP/RP功能时遇到了一个问题,问题描述:
customized version of the card
orinA: jetpack R35.4.1 RootPoint
orinB: jetpack R35.4.1 EndPoint
按照NVIDIA手册使能PCIE EndPoint,
https://docs.nvidia.com/jetson/archives/r35.4.1/DeveloperGuide/text/SD/Communications/PcieEndpointMode.html#bringing-up-an-ethernet-interface-over-pcie
EndPoint:
cd /sys/kernel/config/pci_ep/
mkdir functions/pci_epf_tvnet/func1
echo 16 > functions/pci_epf_tvnet/func1/msi_interrupts
ln -s functions/pci_epf_tvnet/func1 controllers/141a0000.pcie_ep/
echo 1 > controllers/141a0000.pcie_ep/start

rootPoint:
sudo reboot

On the endpoint device: ifconfig eth1 up
On the root port system: ifconfig eth1 up
On the endpoint device: ifconfig eth1 192.168.2.1
On the root port system: ifconfig eth1 192.168.2.2

EndPoint:
root@EAORA07B:/sys/kernel/config/pci_ep# ifconfig eth1 192.168.2.1
root@EAORA07B:/sys/kernel/config/pci_ep# ping 192.168.2.2
PING 192.168.2.2 (192.168.2.2) 56(84) bytes of data.
64 bytes from 192.168.2.2: icmp_seq=1 ttl=64 time=3.49 ms
64 bytes from 192.168.2.2: icmp_seq=2 ttl=64 time=1.14 ms
64 bytes from 192.168.2.2: icmp_seq=3 ttl=64 time=1.12 ms
64 bytes from 192.168.2.2: icmp_seq=4 ttl=64 time=1.19 ms
64 bytes from 192.168.2.2: icmp_seq=5 ttl=64 time=0.991 ms
64 bytes from 192.168.2.2: icmp_seq=6 ttl=64 time=0.956 ms
64 bytes from 192.168.2.2: icmp_seq=7 ttl=64 time=1.01 ms
64 bytes from 192.168.2.2: icmp_seq=8 ttl=64 time=1.07 ms
64 bytes from 192.168.2.2: icmp_seq=9 ttl=64 time=0.872 ms
64 bytes from 192.168.2.2: icmp_seq=10 ttl=64 time=1.01 ms
64 bytes from 192.168.2.2: icmp_seq=11 ttl=64 time=1.00 ms
64 bytes from 192.168.2.2: icmp_seq=12 ttl=64 time=0.979 ms
64 bytes from 192.168.2.2: icmp_seq=13 ttl=64 time=0.986 ms
64 bytes from 192.168.2.2: icmp_seq=14 ttl=64 time=0.980 ms
64 bytes from 192.168.2.2: icmp_seq=15 ttl=64 time=1.17 ms
64 bytes from 192.168.2.2: icmp_seq=16 ttl=64 time=1.16 ms
64 bytes from 192.168.2.2: icmp_seq=17 ttl=64 time=1.08 ms
64 bytes from 192.168.2.2: icmp_seq=18 ttl=64 time=1.02 ms
64 bytes from 192.168.2.2: icmp_seq=19 ttl=64 time=1.15 ms
64 bytes from 192.168.2.2: icmp_seq=20 ttl=64 time=1.13 ms
64 bytes from 192.168.2.2: icmp_seq=21 ttl=64 time=0.983 ms
64 bytes from 192.168.2.2: icmp_seq=22 ttl=64 time=0.785 ms
[ 457.649936] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 457.655324] **************************************
[ 457.660257] CPU:0, Error:cbb-fabric, Errmon:2
[ 457.664747] Error Code : PWRDOWN_ERR
[ 457.668785] Overflow : Multiple PWRDOWN_ERR
[ 457.673455]
[ 457.674985] Error Code : PWRDOWN_ERR
[ 457.679017] MASTER_ID : CCPLEX
[ 457.682518] Address : 0x3a060010
[ 457.686202] Cache : 0x1 – Bufferable
[ 457.690509] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 457.697502] Access_Type : Write
[ 457.701099] Access_ID : 0x1
[ 457.701101] Fabric : cbb-fabric
[ 457.707924] Slave_Id : 0x16
[ 457.711155] Burst_length : 0x0
[ 457.714654] Burst_type : 0x1
[ 457.717979] Beat_size : 0x0
[ 457.721210] VQC : 0x0
[ 457.723992] GRPSEC : 0x7e
[ 457.727051] FALCONSEC : 0x0
[ 457.730284] **************************************
[ 457.735342] WARNING: CPU: 0 PID: 0 at drivers/soc/tegra/cbb/tegra234-cbb.c:577 tegra234_cbb_isr+0x130/0x170
[ 457.745774] —[ end trace c501c38c0ed7f426 ]—
[ 457.750577] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 457.755950] **************************************
[ 457.760874] CPU:0, Error:cbb-fabric, Errmon:2
[ 457.765361] Error Code : PWRDOWN_ERR
[ 457.769396] Overflow : Multiple PWRDOWN_ERR
[ 457.774052]
[ 457.775581] Error Code : PWRDOWN_ERR
[ 457.779616] MASTER_ID : CCPLEX
[ 457.783099] Address : 0x3a06004c
[ 457.786777] Cache : 0x1 – Bufferable
[ 457.791079] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 457.798044] Access_Type : Read
[ 457.801542] Access_ID : 0x11
[ 457.801543] Fabric : cbb-fabric
[ 457.808457] Slave_Id : 0x16
[ 457.811687] Burst_length : 0x0
[ 457.815184] Burst_type : 0x1
[ 457.818507] Beat_size : 0x2
[ 457.821738] VQC : 0x0
[ 457.824529] GRPSEC : 0x7e
[ 457.827586] FALCONSEC : 0x0
[ 457.830818] **************************************
[ 457.835840] WARNING: CPU: 0 PID: 0 at drivers/soc/tegra/cbb/tegra234-cbb.c:577 tegra234_cbb_isr+0x130/0x170
[ 457.845990] —[ end trace c501c38c0ed7f427 ]—
[ 457.850789] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 457.856163] **************************************
[ 457.861087] CPU:0, Error:cbb-fabric, Errmon:2
[ 457.865574] Error Code : PWRDOWN_ERR
[ 457.869605] Overflow : Multiple PWRDOWN_ERR
[ 457.874270]
[ 457.875797] Error Code : PWRDOWN_ERR
[ 457.879824] MASTER_ID : CCPLEX
[ 457.883318] Address : 0x3a06004c
[ 457.886992] Cache : 0x1 – Bufferable
[ 457.891294] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 457.898258] Access_Type : Read
[ 457.901753] Access_ID : 0x15
[ 457.901754] Fabric : cbb-fabric
[ 457.908667] Slave_Id : 0x16
[ 457.911897] Burst_length : 0x0
[ 457.915394] Burst_type : 0x1
[ 457.918716] Beat_size : 0x2
[ 457.921947] VQC : 0x0
[ 457.924738] GRPSEC : 0x7e
[ 457.927794] FALCONSEC : 0x0
[ 457.931026] **************************************
[ 457.936053] WARNING: CPU: 0 PID: 0 at drivers/soc/tegra/cbb/tegra234-cbb.c:577 tegra234_cbb_isr+0x130/0x170
[ 457.946200] —[ end trace c501c38c0ed7f428 ]—
[ 457.950970] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 457.956343] **************************************
[ 457.961273] CPU:0, Error:cbb-fabric, Errmon:2
[ 457.965749] Error Code : PWRDOWN_ERR
[ 457.969780] Overflow : Multiple PWRDOWN_ERR
[ 457.974447]
[ 457.975979] Error Code : PWRDOWN_ERR
[ 457.980014] MASTER_ID : CCPLEX
[ 457.983510] Address : 0x3a06004c
[ 457.987188] Cache : 0x1 – Bufferable
[ 457.991490] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 457.998474] Access_Type : Read
[ 458.001971] Access_ID : 0x10
[ 458.001972] Fabric : cbb-fabric
[ 458.008886] Slave_Id : 0x16
[ 458.012114] Burst_length : 0x0
[ 458.015611] Burst_type : 0x1
[ 458.018934] Beat_size : 0x2
[ 458.022165] VQC : 0x0
[ 458.024955] GRPSEC : 0x7e
[ 458.028009] FALCONSEC : 0x0
[ 458.031241] **************************************
[ 458.036262] WARNING: CPU: 0 PID: 0 at drivers/soc/tegra/cbb/tegra234-cbb.c:577 tegra234_cbb_isr+0x130/0x170
[ 458.046401] —[ end trace c501c38c0ed7f429 ]—
[ 458.051192] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 458.056559] **************************************
[ 458.061482] CPU:0, Error:cbb-fabric, Errmon:2
[ 458.065969] Error Code : PWRDOWN_ERR
[ 458.070004] Overflow : Multiple PWRDOWN_ERR
[ 458.074672]
[ 458.076198] Error Code : PWRDOWN_ERR
[ 458.080229] MASTER_ID : CCPLEX
[ 458.083711] Address : 0x3a06004c
[ 458.087388] Cache : 0x1 – Bufferable
[ 458.091690] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 458.098655] Access_Type : Read
[ 458.102152] Access_ID : 0x16
[ 458.102153] Fabric : cbb-fabric
[ 458.109066] Slave_Id : 0x16
[ 458.112297] Burst_length : 0x0
[ 458.115791] Burst_type : 0x1
[ 458.119114] Beat_size : 0x2
[ 458.122344] VQC : 0x0
[ 458.125133] GRPSEC : 0x7e
[ 458.128189] FALCONSEC : 0x0
[ 458.131420] **************************************
[ 458.136445] WARNING: CPU: 0 PID: 0 at drivers/soc/tegra/cbb/tegra234-cbb.c:577 tegra234_cbb_isr+0x130/0x170
[ 458.146583] —[ end trace c501c38c0ed7f42a ]—
[ 458.151352] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 458.156724] **************************************
[ 458.161644] CPU:0, Error:cbb-fabric, Errmon:2
[ 458.166127] Error Code : PWRDOWN_ERR
[ 458.170158] Overflow : Multiple PWRDOWN_ERR
[ 458.174826]
[ 458.176355] Error Code : PWRDOWN_ERR
[ 458.180390] MASTER_ID : CCPLEX
[ 458.183887] Address : 0x3a06004c
[ 458.187565] Cache : 0x1 – Bufferable
[ 458.191866] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 458.198830] Access_Type : Read
[ 458.202328] Access_ID : 0x12
[ 458.202329] Fabric : cbb-fabric
[ 458.209241] Slave_Id : 0x16
[ 458.212469] Burst_length : 0x0
[ 458.215965] Burst_type : 0x1
[ 458.219288] Beat_size : 0x2
[ 458.222518] VQC : 0x0
[ 458.225309] GRPSEC : 0x7e
[ 458.228366] FALCONSEC : 0x0
[ 458.231598] **************************************
[ 458.236617] WARNING: CPU: 0 PID: 0 at drivers/soc/tegra/cbb/tegra234-cbb.c:577 tegra234_cbb_isr+0x130/0x170
[ 458.246755] —[ end trace c501c38c0ed7f42b ]—
[ 458.251521] CPU:0, Error: cbb-fabric@0x13a00000, irq=34
[ 458.256893] **************************************
[ 458.261817] CPU:0, Error:cbb-fabric, Errmon:2
[ 458.266300] Error Code : PWRDOWN_ERR
[ 458.270335] Overflow : Multiple PWRDOWN_ERR
[ 458.275003]
[ 458.276533] Error Code : PWRDOWN_ERR
[ 458.280568] MASTER_ID : CCPLEX
[ 458.284049] Address : 0x3a06004c
[ 458.287727] Cache : 0x1 – Bufferable
[ 458.292028] Protection : 0x2 – Unprivileged, Non-Secure, Data Access
[ 458.299003] Access_Type : Read
[ 458.302500] Access_ID : 0x16
[ 458.302500] Fabric : cbb-fabric
[ 458.309414] Slave_Id : 0x16
[ 458.312644] Burst_length : 0x0
[ 458.316139] Burst_type : 0x1
[ 458.319461] Beat_size : 0x2
[ 458.322692] VQC : 0x0
[ 458.325482] GRPSEC : 0x7e
[ 458.328539] FALCONSEC : 0x0
[ 458.331770] **************************************

问题在这里:

  1. 在ping的过程中我没有使用Ctrl C停止EndPoint的ping 动作,然后直接sudo reboot RootPoint,结果EndPoint的内核直接崩溃。只能重新上电恢复。

  2. 重复测试,使用ctrl C终止掉EndPoint的ping命令后,在sudo reboot RootPoint不会导致EndPoint Orin 内核崩溃。

问题1可以100%出现。

請問這個問題有多容易複製到?

在我进行测试的custom CVB上是100%复现。

没有在devkit上测试过。

There is no update from you for a period, assuming this is not an issue any more.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

That seems to be custom board issue.

Is this still an issue to support? Any result can be shared?