We have a custom FPGA card connected to the TK1 platform. Some times (once out of about 10 times), we see that the PCIe reports below mentioned errors and become no longer accessible. We need to reboot the device to recover.
[ 1.169182] msgmni has been set to 1430
[ 1.171852] io scheduler noop registered (default)
[ 1.172412] of_get_named_gpio_flags: can’t parse gpios property
[ 1.172445] of_get_named_gpio_flags: can’t parse gpios property
[ 1.172474] of_get_named_gpio_flags: can’t parse gpios property
[ 1.214012] PCI host bridge to bus 0000:00
[ 1.214047] pci_bus 0000:00: root bus resource [mem 0x32100000-0x3fffffff]
[ 1.214082] pci_bus 0000:00: root bus resource [mem 0x12100000-0x320fffff pref]
[ 1.214119] pci_bus 0000:00: root bus resource [io 0x1000-0xffff]
[ 1.214155] pci_bus 0000:00: No busn resource found for root bus, will use [bus 00-ff]
[ 1.214239] pci 0000:00:00.0: [10de:0e12] type 01 class 0x060400
[ 1.214435] pci 0000:00:00.0: PME# supported from D0 D1 D2 D3hot D3cold
[ 1.214996] PCI: bus0: Fast back to back transfers disabled
[ 1.215033] pci 0000:00:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
[ 1.215402] pci 0000:01:00.0: [1172:e001] type 00 class 0xff0000
[ 1.215484] pci 0000:01:00.0: reg 10: [mem 0x00000000-0x0fffffff 64bit pref]
[ 1.215572] pci 0000:01:00.0: reg 18: [mem 0x00000000-0x00007fff]
[ 1.217633] PCI: bus1: Fast back to back transfers disabled
[ 1.217670] pci_bus 0000:01: busn_res: [bus 01-ff] end is updated to 01
[ 1.217710] pci_bus 0000:00: busn_res: [bus 00-ff] end is updated to 01
[ 1.218252] pcieport 0000:00:00.0: Signaling PME through PCIe PME interrupt
[ 1.218290] pci 0000:01:00.0: Signaling PME through PCIe PME interrupt
[ 1.218327] pcie_pme 0000:00:00.0:pcie01: service driver pcie_pme loaded
[ 1.218606] aer 0000:00:00.0:pcie02: service driver aer loaded
[ 1.218971] PCI host bridge to bus 0000:02
[ 1.219002] pci_bus 0000:02: root bus resource [mem 0x32100000-0x3fffffff]
[ 1.219036] pci_bus 0000:02: root bus resource [mem 0x12100000-0x320fffff pref]
[ 1.219072] pci_bus 0000:02: root bus resource [io 0x10000-0x1ffff]
[ 1.219107] pci_bus 0000:02: No busn resource found for root bus, will use [bus 02-ff]
[ 1.219195] pci 0000:02:00.0: [10de:0e13] type 01 class 0x060400
[ 1.219384] pci 0000:02:00.0: PME# supported from D0 D1 D2 D3hot D3cold
[ 1.219938] PCI: bus2: Fast back to back transfers disabled
[ 1.219974] pci 0000:02:00.0: bridge configuration invalid ([bus 00-00]), reconfiguring
[ 1.220340] pci 0000:03:00.0: [8086:1533] type 00 class 0x020000
[ 1.220410] pci 0000:03:00.0: reg 10: [mem 0x00000000-0x000fffff]
[ 1.220487] pci 0000:03:00.0: reg 18: [io 0x0000-0x001f]
[ 1.220570] pci 0000:03:00.0: reg 1c: [mem 0x00000000-0x00003fff]
[ 1.220672] pci 0000:03:00.0: reg 30: [mem 0x00000000-0x000fffff pref]
[ 1.220844] pci 0000:03:00.0: PME# supported from D0 D3hot D3cold
[ 1.222597] PCI: bus3: Fast back to back transfers disabled
[ 1.222634] pci_bus 0000:03: busn_res: [bus 03-ff] end is updated to 03
[ 1.222673] pci_bus 0000:02: busn_res: [bus 02-ff] end is updated to 03
[ 1.223177] pcieport 0000:02:00.0: Signaling PME through PCIe PME interrupt
[ 1.223215] pci 0000:03:00.0: Signaling PME through PCIe PME interrupt
[ 1.223253] pcie_pme 0000:02:00.0:pcie01: service driver pcie_pme loaded
[ 1.223539] aer 0000:02:00.0:pcie02: service driver aer loaded
[ 1.223753] pcieport 0000:02:00.0: BAR 8: assigned [mem 0x32100000-0x322fffff]
[ 1.223793] pcieport 0000:02:00.0: BAR 9: assigned [mem 0x12100000-0x121fffff pref]
[ 1.223832] pcieport 0000:02:00.0: BAR 7: assigned [io 0x10000-0x10fff]
[ 1.223871] pci 0000:03:00.0: BAR 0: assigned [mem 0x32100000-0x321fffff]
[ 1.223914] pci 0000:03:00.0: BAR 6: assigned [mem 0x12100000-0x121fffff pref]
[ 1.223952] pci 0000:03:00.0: BAR 3: assigned [mem 0x32200000-0x32203fff]
[ 1.223993] pci 0000:03:00.0: BAR 2: assigned [io 0x10000-0x1001f]
[ 1.224032] pcieport 0000:02:00.0: PCI bridge to [bus 03]
[ 1.224064] pcieport 0000:02:00.0: bridge window [io 0x10000-0x10fff]
[ 1.224101] pcieport 0000:02:00.0: bridge window [mem 0x32100000-0x322fffff]
[ 1.224140] pcieport 0000:02:00.0: bridge window [mem 0x12100000-0x121fffff pref]
[ 1.224202] pcieport 0000:00:00.0: BAR 9: assigned [mem 0x20000000-0x2fffffff 64bit pref]
[ 1.224242] pcieport 0000:00:00.0: BAR 8: assigned [mem 0x32300000-0x323fffff]
[ 1.224282] pci 0000:01:00.0: BAR 0: assigned [mem 0x20000000-0x2fffffff 64bit pref]
[ 1.224346] pci 0000:01:00.0: BAR 2: assigned [mem 0x32300000-0x32307fff]
[ 1.224386] pcieport 0000:00:00.0: PCI bridge to [bus 01]
[ 1.224421] pcieport 0000:00:00.0: bridge window [mem 0x32300000-0x323fffff]
[ 1.224459] pcieport 0000:00:00.0: bridge window [mem 0x20000000-0x2fffffff 64bit pref]
[ 1.249454] pcieport 0000:00:00.0: AER: Uncorrected (Non-Fatal) error received: id=0010
[ 1.249593] pcieport 0000:00:00.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, id=0000(Requester ID)
[ 1.249642] pcieport 0000:00:00.0: device [10de:0e12] error status/mask=00004000/00000000
[ 1.249681] pcieport 0000:00:00.0: [14] Completion Timeout (First)
[ 1.249722] pcieport 0000:00:00.0: broadcast error_detected message
[ 1.249759] pcieport 0000:00:00.0: AER: Device recovery failed
[ 1.249793] pcieport 0000:00:00.0: AER: Multiple Uncorrected (Non-Fatal) error received: id=0010
[ 1.249853] pcieport 0000:00:00.0: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, id=0000(Requester ID)
[ 1.249861] pcieport 0000:00:00.0: device [10de:0e12] error status/mask=00004000/00000000
[ 1.249868] pcieport 0000:00:00.0: [14] Completion Timeout (First)
[ 1.249882] pcieport 0000:00:00.0: broadcast error_detected message
[ 1.249894] pcieport 0000:00:00.0: AER: Device recovery failed
[ 1.250929] pwm-backlight pwm-backlight: unable to request PWM, trying legacy API
[ 1.251128] sysedp_create_consumer: unable to create pwm-backlight, no consumer_data for pwm-backlight found
[ 1.252688] tsec tsec: initialized
What could be the reason?
We are using the latest 21.5 version kernel. Any suggestions will be very helpful.
Thanks in advance.