we have a custom PCIe switch based on PEX8613. It has been working ok with R23.2.0. However, contrary to release note for R24 which states certain PCIe hanging having been resolved, we continue to see 9 out of 10 reboots of TX1 hanging during kernel boot. uboot still is good. we can query pci and it displays this:
Tegra210 (P2371-2180) # pci
Scanning PCI devices on bus 0
BusDevFun VendorId DeviceId Device Class Sub-Class
00.01.00 0x10de 0x0fae Bridge device 0x04
Little bit about PEX8613 with R23 kernel; it shows the following:
00:01.0 PCI bridge: NVIDIA Corporation Device 0fae (rev a1)
01:00.0 PCI bridge: PLX Technology, Inc. PEX 8613 12-lane, 3-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev ba)
02:01.0 Bridge: PLX Technology, Inc. PEX 8613 12-lane, 3-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev ba)
02:02.0 PCI bridge: PLX Technology, Inc. PEX 8613 12-lane, 3-Port PCI Express Gen 2 (5.0 GT/s) Switch (rev ba)
Having said that, R23 has an odd message that shows up occasionally:
ubuntu@tegra-ubuntu:~$ [ 29.294637] pgd = ffffffc0f5260000
[ 29.298036] [0000000c] *pgd=000000015b6f7003, *pmd=000000015bf95003, *pte=0000000000000000
[ 29.306472] Library at 0xf26e1e44: 0xf26c3000 /usr/lib/arm-linux-gnueabihf/qt5/plugins/platforms/libqxcb.so
[ 29.316211] Library at 0x2: 0x8000 /usr/bin/signon-ui
[ 29.321254] vdso base = 0xf72ab000
[ 325.476343] vgaarb: this pci device is not a vga device
Could it be that TX1’s video driver keeps hitting PCIe and doesn’t recognize bridge type device. And in R24 its really broken and causing failures almost 100% of the time?
I can send you full boot logs fro R23 and R24, if that would help.
One last thing, on occasion when R24 does boot up, it doesn’t last for very long. it crashes as soon as lscpi is executed or in most instances, just starts dumping registers and causes CPU0 to hardlock.
TX1-R24-2-PCIe-Failure.txt (4.55 KB)
TX1-R24-2-PCIe-OK-WithErrors.txt (3.68 KB)