Dear NVIDIA Support Team,
I am encountering issues with interrupt distribution on my Jetson AGX Orin when using an Intel 82599ES 10-Gigabit Network Card. All RX interrupts are consistently routed to CPU0, even after attempting various affinity configuration methods.
System Configuration
- Hardware:
- Jetson AGX Orin Dev Kit (JetPack 6.0 rev2 / L4T R36.2.1)
- PCIe Card: Intel 82599ES 10-Gigabit SFI/SFP+ Network Connection (rev 01)
- PCIe Slot: x16 Gen3
- Software:
- Kernel:
5.15.136-tegra
(PREEMPT RT) - Driver:
ixgbe
v5.15.136-tegra - Firmware:
0x8000091d, 1.1825.0
- Kernel:
Problem Symptoms
- Interrupt Concentration:
All RX queues (eth1-TxRx-0
toeth1-TxRx-11
) are handled by CPU0, as shown in/proc/interrupts
:
302: 89874 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612928 Edge eth1-TxRx-0
303: 18512388 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612929 Edge eth1-TxRx-1
304: 2211057 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612930 Edge eth1-TxRx-2
305: 17802941 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612931 Edge eth1-TxRx-3
306: 308635 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612932 Edge eth1-TxRx-4
307: 660103 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612933 Edge eth1-TxRx-5
308: 371502 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612934 Edge eth1-TxRx-6
309: 95708 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612935 Edge eth1-TxRx-7
310: 251272 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612936 Edge eth1-TxRx-8
311: 829679 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612937 Edge eth1-TxRx-9
312: 201855 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612938 Edge eth1-TxRx-10
313: 156559 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612939 Edge eth1-TxRx-11
314: 3 0 0 0 0 0 0 0 0 0 0 0 PCI-MSI 671612940 Edge eth1
- Failed Affinity Configuration:
Attempts to set CPU affinity via/proc/irq
or/sys/class/net
have failed:
$ echo 1 | sudo tee /proc/irq/302/smp_affinity_list
tee: /proc/irq/302/smp_affinity_list: Invalid argument
$ echo 1 | sudo tee /proc/irq/302/smp_affinity
tee: /proc/irq/302/smp_affinity: Invalid argument
- RPS Configuration Ineffective:
Configuring RPS (Receive Packet Steering) did not redistribute interrupts:
$ cat /sys/class/net/eth1/queues/rx-*/rps_cpus
001
008
016
002
004
008
016
032
064
128
256
512
Despite configuration, interrupts remain on CPU0.
Troubleshooting Steps
- PCIe Device Verification:
$ lspci -s 0005:01:00.0
0005:01:00.0 Ethernet controller: Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection (rev 01)
- Multi-Queue Support:
$ ethtool -l eth1
Channel parameters for eth1:
Pre-set maximums:
RX: n/a
TX: n/a
Other: 1
Combined: 12
Current hardware settings:
RX: n/a
TX: n/a
Other: 1
Combined: 12
- Interrupt Affinity Settings:
All IRQs show affinity mask0-11
(all CPUs), but interrupts still go to CPU0:
$ cat /proc/irq/{302..314}/smp_affinity_list
0-11
0-11
0-11
0-11
0-11
0-11
0-11
0-11
0-11
0-11
0-11
0-11
0-11
Questions
- Why are interrupts locked to CPU0 even when:
smp_affinity_list
is set to0-11
?- RPS is configured to distribute to other CPUs?
- How can I force interrupts to be distributed across CPU1-CPU11?
- Are there Tegra-specific kernel parameters or driver flags for this?
- Is there a hardware/software compatibility issue between the pcie network card and Jetson AGX Orin?
Thank you for your assistance!