omp,
I have done some testing below.
- burn both EP and RC with original JetPack4.3 packet, and it works for eth1 iperf3.
- replace both EP and RC file /boot/Image with JetPack4.4 packet version which is under kernel/Image
- do the same testing it works well and can bootup correctly.
- then I update both EP and RC dtb with command “sudo ./flash.sh -k kernel-dtb jetson-xavier mmcblk0p1” under JetPacket 4.4
- bootup both EP and RC then I see de PCIe Bus Err showing.
Seems some changing in dtb will cause PCIe Bus Err showing, can you reproduce on your side?
What is the key dts changing between JetPack 4.3 and 4.4?
I still found anther two issues:
- replace JetPack 4.4 Image version don’t see any improvement on iperf3 tcp testing still 100M band.
- if do iperf3 udp testing, there is coredump.
pcie-rc:~$ sudo ifconfig eth1 192.168.2.1;iperf3 -s
-----------------------------------------------------------
Server listening on 5201
-----------------------------------------------------------
Accepted connection from 192.168.2.2, port 48820
[ 5] local 192.168.2.1 port 5201 connected to 192.168.2.2 port 48822
[ ID] Interval Transfer Bandwidth
[ 5] 0.00-1.00 sec 8.08 MBytes 67.8 Mbits/sec
[ 5] 1.00-2.00 sec 12.0 MBytes 100 Mbits/sec
[ 5] 2.00-3.00 sec 12.2 MBytes 102 Mbits/sec
[ 5] 3.00-4.00 sec 12.3 MBytes 103 Mbits/sec
[ 5] 4.00-5.00 sec 11.6 MBytes 97.1 Mbits/sec
[ 5] 5.00-6.00 sec 11.8 MBytes 98.8 Mbits/sec
[ 5] 6.00-7.00 sec 12.3 MBytes 103 Mbits/sec
[ 5] 7.00-8.00 sec 12.4 MBytes 104 Mbits/sec
[ 5] 8.00-9.00 sec 11.4 MBytes 95.3 Mbits/sec
[ 5] 9.00-10.00 sec 12.1 MBytes 102 Mbits/sec
[ 5] 10.00-10.05 sec 608 KBytes 99.0 Mbits/sec
- - - - - - - - - - - - - - - - - - - - - - - - -
[ ID] Interval Transfer Bandwidth
[ 5] 0.00-10.05 sec 0.00 Bytes 0.00 bits/sec sender
[ 5] 0.00-10.05 sec 117 MBytes 97.4 Mbits/sec receiver
-----------------------------------------------------------
Server listening on 5201
-----------------------------------------------------------
Accepted connection from 192.168.2.2, port 48824
[ 5] local 192.168.2.1 port 5201 connected to 192.168.2.2 port 48986
[ ID] Interval Transfer Bandwidth Jitter Lost/Total Datagrams
[ 5] 0.00-1.00 sec 0.00 Bytes 0.00 bits/sec 0.000 ms 0/0 (0%)
[ 139.748695] tegradc 15200000.nvdisplay: blank - powerdown
[ 139.793566] extcon-disp-state external-connection:disp-state: cable 48 state 0
[ 139.793573] Extcon AUX2(HDMI) disable
[ 139.820022] tegra_nvdisp_handle_pd_disable: Powergated Head1 pd
[ 139.822376] tegra_nvdisp_handle_pd_disable: Powergated Head0 pd
[ 139.822893] tegradc 15200000.nvdisplay: unblank
[ 139.839843] tegra_nvdisp_handle_pd_enable: Unpowergated Head0 pd
[ 139.840088] tegra_nvdisp_handle_pd_enable: Unpowergated Head1 pd
[ 139.852734] Parent Clock set for DC plld3
[ 139.857654] tegradc 15200000.nvdisplay: hdmi: tmds rate:148500K prod-setting:prod_c_hdmi_111m_223m
[ 139.859367] tegradc 15200000.nvdisplay: hdmi: get RGB quant from EDID.
[ 139.859384] tegradc 15200000.nvdisplay: hdmi: get YCC quant from EDID.
[ 139.901023] extcon-disp-state external-connection:disp-state: cable 48 state 1
[ 139.901032] Extcon AUX2(HDMI) enable
[ 140.495865] skbuff: skb_over_panic: text:ffffff8008938f84 len:8234 put:8234 head:ffffffc3ba04a600 data:ffffffc3ba04a640 tail:0x206a end:0x640 dev:eth1
[ 140.496851] ------------[ cut here ]------------
[ 140.496940] kernel BUG at /home/nvidia/nvidia_sdk/JetPack_4.3_Linux_P2888/Linux_for_Tegra/sources/kernel/kernel-4.9/net/core/skbuff.c:105!
[ 140.496987] Internal error: Oops - BUG: 0 [#1] PREEMPT SMP
[ 140.497055] Modules linked in:
[ 140.497330] CPU: 0 PID: 1177 Comm: kworker/0:2 Not tainted 4.9.140+ #2
[ 140.497358] Hardware name: Jetson-AGX (DT)
[ 140.497502] Workqueue: events process_ep2h_msg
[ 140.497547] task: ffffffc3e985aa00 task.stack: ffffffc3e98b8000
[ 140.497653] PC is at skb_panic+0x6c/0x70
[ 140.497709] LR is at skb_panic+0x6c/0x70
[ 140.497767] pc : [<ffffff8008d6e674>] lr : [<ffffff8008d6e674>] pstate: 20c00045
[ 140.497798] sp : ffffffc3e98bbc90
[ 140.497901] x29: ffffffc3e98bbca0 x28: ffffffc3d6730c40
[ 140.498036] x27: ffffffc3d821a9d0 x26: ffffff8023005018
[ 140.498134] x25: ffffff8023003000 x24: ffffffc3d821a000
[ 140.498234] x23: ffffffc3c8b63d00 x22: 000000000000202a
[ 140.498332] x21: ffffff8008938f84 x20: ffffff8009145af0
[ 140.498632] x19: ffffffc3c8b63d00 x18: 0000000000000060
[ 140.498732] x17: 000000000000c5c5 x16: 00000950fffff6af
[ 140.498826] x15: 00000000fffffffd x14: 6233636666666666
[ 140.498918] x13: 663a646165682034 x12: 3332383a74757020
[ 140.499012] x11: 343332383a6e656c x10: 2034386638333938
[ 140.499195] x9 : 3030386666666666 x8 : ffffffc3ffc318a0
[ 140.499290] x7 : 0000000000000000 x6 : 0000000016d3d2b6
[ 140.499378] x5 : 0000000000000000 x4 : 0000000000000000
[ 140.499473] x3 : ffffffffffffffff x2 : 00000043f645a000
[ 140.499568] x1 : ffffffc3e985aa00 x0 : 000000000000008a
[ 140.499592]
[ 140.499751] Process kworker/0:2 (pid: 1177, stack limit = 0xffffffc3e98b8000)
[ 140.499779] Call trace:
[ 140.499857] [<ffffff8008d6e674>] skb_panic+0x6c/0x70
[ 140.499918] [<ffffff8008d7099c>] skb_put+0x8c/0x90
[ 140.499978] [<ffffff8008938f84>] process_ep2h_msg+0x13c/0x198
[ 140.500046] [<ffffff80080d4e58>] process_one_work+0x1e8/0x490
[ 140.500112] [<ffffff80080d5158>] worker_thread+0x58/0x4c0
[ 140.500436] [<ffffff80080dbb28>] kthread+0xd8/0xf0
[ 140.500501] [<ffffff8008083850>] ret_from_fork+0x10/0x40
[ 140.598416] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xfff8b000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=457618003, pte=0
[ 140.599345] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xfff86000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=457618003, pte=0
[ 140.599979] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xfff66000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=457618003, pte=0
[ 140.600722] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xfff5d000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=457618003, pte=0
[ 140.601399] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xffda1000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=443da2003, pte=0
[ 140.601916] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xffda1000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=443da2003, pte=0
[ 140.603299] mc-err: vpr base=0:0, size=0, ctrl=3, override:(a01a8340, fcee10c1, 1, 0)
[ 140.603370] mc-err: (255) csw_pcie5w: MC request violates VPR requirements
[ 140.603459] mc-err: status = 0x0ff740e3; addr = 0xffffffff00; hi_adr_reg=008
[ 140.603491] mc-err: secure: yes, access-type: write
[ 140.603855] mc-err: mcerr: unknown intr source intstatus = 0x00000000, intstatus_1 = 0x00000000
[ 140.604783] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xffd84000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=443da2003, pte=0
[ 140.605231] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xffd62000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=443da2003, pte=0
[ 140.606317] mc-err: vpr base=0:0, size=0, ctrl=3, override:(a01a8340, fcee10c1, 1, 0)
[ 140.606421] mc-err: (255) csw_pcie5w: MC request violates VPR requirements
[ 140.606476] mc-err: status = 0x0ff740e3; addr = 0xffffffff00; hi_adr_reg=008
[ 140.606505] mc-err: secure: yes, access-type: write
[ 140.606592] mc-err: mcerr: unknown intr source intstatus = 0x00000000, intstatus_1 = 0x00000000
[ 140.610315] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xffd58000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=443da2003, pte=0
[ 140.610807] t19x-arm-smmu 12000000.iommu: Unhandled context fault: smmu0, iova=0xffd54000, fsynr=0xc0013, cb=3, sid=91(0x5b - PCIE5), pgd=4575eb003, pud=4575eb003, pmd=443da2003, pte=0
[ 140.614934] mc-err: Too many MC errors; throttling prints
[ 141.518931] ---[ end trace 01525e739c6eb4e8 ]---
[ 141.544974] Kernel panic - not syncing: Fatal exception in interrupt
[ 141.544980] SMP: stopping secondary CPUs
[ 141.544999] Kernel Offset: disabled
[ 141.545001] Memory Limit: none
[ 142.014180] trusty-log panic notifier - trusty version Built: 22:43:40 Dec 9 2019
[ 142.014180] Rebooting in 5 seconds..
pcie-ep:~$ sudo ifconfig eth1 192.168.2.2;iperf3 -c 192.168.2.1 -b 1000M
Connecting to host 192.168.2.1, port 5201
[ 4] local 192.168.2.2 port 48822 connected to 192.168.2.1 port 5201
[ ID] Interval Transfer Bandwidth Retr Cwnd
[ 4] 0.00-1.00 sec 9.52 MBytes 79.9 Mbits/sec 0 376 KBytes
[ 4] 1.00-2.00 sec 12.1 MBytes 101 Mbits/sec 0 510 KBytes
[ 4] 2.00-3.00 sec 12.5 MBytes 105 Mbits/sec 0 529 KBytes
[ 4] 3.00-4.00 sec 12.1 MBytes 102 Mbits/sec 0 556 KBytes
[ 4] 4.00-5.00 sec 11.9 MBytes 99.8 Mbits/sec 0 556 KBytes
[ 4] 5.00-6.00 sec 11.6 MBytes 97.5 Mbits/sec 0 585 KBytes
[ 4] 6.00-7.00 sec 12.4 MBytes 104 Mbits/sec 0 585 KBytes
[ 4] 7.00-8.00 sec 12.1 MBytes 102 Mbits/sec 0 585 KBytes
[ 4] 8.00-9.00 sec 11.6 MBytes 97.5 Mbits/sec 0 585 KBytes
[ 4] 9.00-10.00 sec 12.4 MBytes 104 Mbits/sec 0 585 KBytes
- - - - - - - - - - - - - - - - - - - - - - - - -
[ ID] Interval Transfer Bandwidth Retr
[ 4] 0.00-10.00 sec 118 MBytes 99.2 Mbits/sec 0 sender
[ 4] 0.00-10.00 sec 117 MBytes 97.9 Mbits/sec receiver
iperf Done.
pcie-ep:~$ sudo ifconfig eth1 192.168.2.2;iperf3 -u -c 192.168.2.1 -b 1000M
Connecting to host 192.168.2.1, port 5201
[ 4] local 192.168.2.2 port 48986 connected to 192.168.2.1 port 5201