Multiple users faced have faced the same issue in this forum. No fruitful resolution from NVIDIA yet.
Is this issue specific to NVIDIA driver?
Operation :
Screen freeze during display. Xorg and nv_queue processes had 100% CPU utilization during the time :
5283 root 20 0 190m 53m 38m R 100.0 0.3 44:01.43 Xorg
5360 root 20 0 0 0 0 R 100.0 0.0 28:12.56 nv_queue
OS version:
Linux version 4.1.12-124.24.3.el6uek.x86_64
Red Hat 4.9.2-6.2.0.3
gcc version 4.9.2
GPU used:
NVIDIA GPU Quadro P2000 (GP106GL-A) at PCI:4:0:0 (GPU-0)
Memory: 5242880 kBytes
VideoBIOS: 86.06.74.00.01
NVIDIA dlloader X Driver 430.26
Error logs :
2019-10-19T00:16:40.552138-05:00 magic kernel: [ 3332.730693] ------------[ cut here ]------------
2019-10-19T00:16:40.552149-05:00 magic kernel: [ 3332.730700] WARNING: CPU: 0 PID: 5283 at net/sched/sch_generic.c:306 dev_watchdog+0x246/0x250()
2019-10-19T00:16:40.552150-05:00 magic kernel: [ 3332.730701] NETDEV WATCHDOG: eth0 (igb): transmit queue 4 timed out
2019-10-19T00:16:40.552158-05:00 magic kernel: [ 3332.730702] Modules linked in: xt_nat veth nvidia_uvm(OE) nfnetlink_queue nfnetlink_log nfnetlink ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay tun ip6table_filter ip6_tables iptable_filter ip_tables ipv6 rcim2usb(OE) psysdrv(POE) esdcan_pci200(POE) cbc dm_crypt iTCO_wdt iTCO_vendor_support hp_wmi sparse_keymap rfkill serio_raw pcspkr sb_edac edac_core lpc_ich mfd_core i2c_i801 snd_hda_codec_hdmi e1000e sg xhci_pci xhci_hcd snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_controller snd_hda_codec snd_hda_core snd_seq snd_seq_device snd_pcm snd_timer snd_hwdep snd soundcore nvidia_drm(POE) drm nvidia_modeset(POE) nvidia(POE) ipmi_msghandler igb dca i2c_algo_bit i2c_core ptp pps_core wmi ext4 jbd2 mbcache2 sr_mod cdrom sd_mod ahci libahci dm_mirror dm_region_hash dm_log dm_mod
2019-10-19T00:16:40.552170-05:00 magic kernel: [ 3332.730740] CPU: 0 PID: 5283 Comm: Xorg Tainted: P OE Z 4.1.12-124.24.3.el6uek.x86_64 #2
2019-10-19T00:16:40.552172-05:00 magic kernel: [ 3332.730741] Hardware name: Hewlett-Packard HP Z440 Workstation/212B, BIOS M60 v02.38 11/08/2017
2019-10-19T00:16:40.552173-05:00 magic kernel: [ 3332.730742] 0000000000000000 ffff88041c603d58 ffffffff816e9810 ffff88041c603da8
2019-10-19T00:16:40.552174-05:00 magic kernel: [ 3332.730744] ffffffff81a647a0 ffff88041c603d98 ffffffff8108601a ffff88041c603d80
2019-10-19T00:16:40.552175-05:00 magic kernel: [ 3332.730746] 0000000000000004 ffff880417d33b00 0000000000000000 ffff880413d34000
2019-10-19T00:16:40.552176-05:00 magic kernel: [ 3332.730748] Call Trace:
2019-10-19T00:16:40.552179-05:00 magic kernel: [ 3332.730749] [] dump_stack+0x63/0x81
2019-10-19T00:16:40.552180-05:00 magic kernel: [ 3332.730757] [] warn_slowpath_common+0x8a/0xc0
2019-10-19T00:16:40.552182-05:00 magic kernel: [ 3332.730759] [] warn_slowpath_fmt+0x46/0x50
2019-10-19T00:16:40.552183-05:00 magic kernel: [ 3332.730762] [] dev_watchdog+0x246/0x250
2019-10-19T00:16:40.552184-05:00 magic kernel: [ 3332.730764] [] ? dev_graft_qdisc+0x80/0x80
2019-10-19T00:16:40.552185-05:00 magic kernel: [ 3332.730767] [] call_timer_fn+0x40/0x160
2019-10-19T00:16:40.552187-05:00 magic kernel: [ 3332.730769] [] ? dev_graft_qdisc+0x80/0x80
2019-10-19T00:16:40.552188-05:00 magic kernel: [ 3332.730770] [] run_timer_softirq+0x278/0x350
2019-10-19T00:16:40.552189-05:00 magic kernel: [ 3332.730772] [] __do_softirq+0x120/0x300
2019-10-19T00:16:40.552191-05:00 magic kernel: [ 3332.730774] [] irq_exit+0x125/0x130
2019-10-19T00:16:40.552192-05:00 magic kernel: [ 3332.730777] [] smp_apic_timer_interrupt+0x54/0x70
2019-10-19T00:16:40.552193-05:00 magic kernel: [ 3332.730779] [] apic_timer_interrupt+0x196/0x1a0
2019-10-19T00:16:40.552194-05:00 magic kernel: [ 3332.730780]
2019-10-19T00:16:40.552195-05:00 magic kernel: [ 3332.730781] —[ end trace 768750f06e092ab4 ]—
2019-10-19T00:16:40.552196-05:00 magic kernel: [ 3332.730791] igb 0000:01:00.3 eth0: Reset adapter
2019-10-19T00:16:41.210112-05:00 magic kernel: [ 3333.388692] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:42.540110-05:00 magic kernel: [ 3334.718693] igb 0000:01:00.3 eth0: igb: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: RX
2019-10-19T00:16:43.210110-05:00 magic kernel: [ 3335.388684] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:45.210122-05:00 magic kernel: [ 3337.388676] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:47.210140-05:00 magic kernel: [ 3339.388665] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:49.210111-05:00 magic kernel: [ 3341.388658] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:51.210109-05:00 magic kernel: [ 3343.388646] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:53.210128-05:00 magic kernel: [ 3345.388637] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:54.458752-05:00 magic dhcpd: DHCPREQUEST for 192.168.2.45 from 14:78:0b:00:0b:35 (31825717E2451) via eth1
2019-10-19T00:16:54.458777-05:00 magic dhcpd: DHCPACK on 192.168.2.45 to 14:78:0b:00:0b:35 (31825717E2451) via eth1
2019-10-19T00:16:55.210141-05:00 magic kernel: [ 3347.388628] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:55.928770-05:00 magic kernel: [ 3348.103624] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
2019-10-19T00:16:55.928785-05:00 magic kernel: [ 3348.103629] Modules linked in: xt_nat veth nvidia_uvm(OE) nfnetlink_queue nfnetlink_log nfnetlink ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay tun ip6table_filter ip6_tables iptable_filter ip_tables ipv6 rcim2usb(OE) psysdrv(POE) esdcan_pci200(POE) cbc dm_crypt iTCO_wdt iTCO_vendor_support hp_wmi sparse_keymap rfkill serio_raw pcspkr sb_edac edac_core lpc_ich mfd_core i2c_i801 snd_hda_codec_hdmi e1000e sg xhci_pci xhci_hcd snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_controller snd_hda_codec snd_hda_core snd_seq snd_seq_device snd_pcm snd_timer snd_hwdep snd soundcore nvidia_drm(POE) drm nvidia_modeset(POE) nvidia(POE) ipmi_msghandler igb dca i2c_algo_bit i2c_core ptp pps_core wmi ext4 jbd2 mbcache2 sr_mod cdrom sd_mod ahci libahci dm_mirror dm_region_hash dm_log dm_mod
2019-10-19T00:16:55.928787-05:00 magic kernel: [ 3348.103695] CPU: 2 PID: 0 Comm: swapper/2 Tainted: P W OE Z 4.1.12-124.24.3.el6uek.x86_64 #2
2019-10-19T00:16:55.928788-05:00 magic kernel: [ 3348.103697] Hardware name: Hewlett-Packard HP Z440 Workstation/212B, BIOS M60 v02.38 11/08/2017
2019-10-19T00:16:55.928791-05:00 magic kernel: [ 3348.103700] task: ffff88041a0cb800 ti: ffff88041a0e0000 task.ti: ffff88041a0e0000
2019-10-19T00:16:55.928799-05:00 magic kernel: [ 3348.103702] RIP: 0010:[] [] _nv029925rm+0x12/0x40 [nvidia]
2019-10-19T00:16:55.928800-05:00 magic kernel: [ 3348.103876] RSP: 0018:ffff88041c683b88 EFLAGS: 00000246
2019-10-19T00:16:55.928801-05:00 magic kernel: [ 3348.103878] RAX: 00000000136000a1 RBX: 0000000000000000 RCX: 0000000000000000
2019-10-19T00:16:55.928802-05:00 magic kernel: [ 3348.103881] RDX: ffff880416c25290 RSI: ffff880416c24008 RDI: ffff880417d1d008
2019-10-19T00:16:55.928803-05:00 magic kernel: [ 3348.103883] RBP: ffff880417412dc0 R08: 0000000000000020 R09: ffff880417412dd8
2019-10-19T00:16:55.928804-05:00 magic kernel: [ 3348.103885] R10: ffffffffa05d09b0 R11: ffffffffffffffff R12: ffff88041c683af8
2019-10-19T00:16:55.928805-05:00 magic kernel: [ 3348.103887] R13: ffffffff816f36b6 R14: ffff880417412dc0 R15: ffff880416c25290
2019-10-19T00:16:55.928807-05:00 magic kernel: [ 3348.103890] FS: 0000000000000000(0000) GS:ffff88041c680000(0000) knlGS:0000000000000000
2019-10-19T00:16:55.928808-05:00 magic kernel: [ 3348.103892] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2019-10-19T00:16:55.928809-05:00 magic kernel: [ 3348.103894] CR2: ffffffffff600400 CR3: 0000000001adc000 CR4: 0000000000160670
2019-10-19T00:16:55.928809-05:00 magic kernel: [ 3348.103896] Stack:
2019-10-19T00:16:55.928811-05:00 magic kernel: [ 3348.103898] ffffffffa08fe9f0 ffffffffa05d0a99 0000000000000000 ffff8800b6275808
2019-10-19T00:16:55.928812-05:00 magic kernel: [ 3348.103902] ffff880417412e58 0000000000000002 ffff880416c24008 ffffffffa06b6cd0
2019-10-19T00:16:55.928813-05:00 magic kernel: [ 3348.103905] ffff880416c24008 ffff8800b6275808 0000000000000001 0000000000000002
2019-10-19T00:16:55.928813-05:00 magic kernel: [ 3348.103909] Call Trace:
2019-10-19T00:16:55.928814-05:00 magic kernel: [ 3348.103911]
2019-10-19T00:16:55.928815-05:00 magic kernel: [ 3348.104063] [] ? _nv029924rm+0x40/0x40 [nvidia]
2019-10-19T00:16:55.928816-05:00 magic kernel: [ 3348.104311] [] ? _nv020584rm+0xe9/0x160 [nvidia]
2019-10-19T00:16:55.928817-05:00 magic kernel: [ 3348.104586] [] ? _nv026372rm+0x50/0x4f0 [nvidia]
2019-10-19T00:16:55.928818-05:00 magic kernel: [ 3348.104849] [] ? _nv026052rm+0x93/0x100 [nvidia]
2019-10-19T00:16:55.928819-05:00 magic kernel: [ 3348.105113] [] ? _nv007734rm+0x115/0x180 [nvidia]
2019-10-19T00:16:55.928820-05:00 magic kernel: [ 3348.105376] [] ? _nv031029rm+0x28/0x70 [nvidia]
2019-10-19T00:16:55.928821-05:00 magic kernel: [ 3348.105637] [] ? _nv031135rm+0x238/0x270 [nvidia]
2019-10-19T00:16:55.928822-05:00 magic kernel: [ 3348.105882] [] ? _nv020432rm+0xd8/0x190 [nvidia]
2019-10-19T00:16:55.928823-05:00 magic kernel: [ 3348.106128] [] ? _nv020584rm+0xc5/0x160 [nvidia]
2019-10-19T00:16:55.928824-05:00 magic kernel: [ 3348.106392] [] ? _nv020584rm+0x9e/0x160 [nvidia]
2019-10-19T00:16:55.928825-05:00 magic kernel: [ 3348.106649] [] ? _nv022212rm+0x23/0x70 [nvidia]
2019-10-19T00:16:55.928826-05:00 magic kernel: [ 3348.106907] [] ? _nv022222rm+0x7d/0x120 [nvidia]
2019-10-19T00:16:55.928827-05:00 magic kernel: [ 3348.107158] [] ? _nv032963rm+0xb8/0x140 [nvidia]
2019-10-19T00:16:55.928828-05:00 magic kernel: [ 3348.107409] [] ? _nv032965rm+0x522/0x6a0 [nvidia]
2019-10-19T00:16:55.928829-05:00 magic kernel: [ 3348.107657] [] ? _nv032964rm+0x57/0x3a0 [nvidia]
2019-10-19T00:16:55.928830-05:00 magic kernel: [ 3348.107757] [] ? _nv030158rm+0x111/0x1e0 [nvidia]
2019-10-19T00:16:55.928831-05:00 magic kernel: [ 3348.107824] [] ? nv_pci_register_driver+0x20/0x20 [nvidia]
2019-10-19T00:16:55.928832-05:00 magic kernel: [ 3348.107963] [] ? rm_run_rc_callback+0x8b/0xe0 [nvidia]
2019-10-19T00:16:55.928834-05:00 magic kernel: [ 3348.108031] [] ? nvidia_rc_timer_callback+0x45/0x70 [nvidia]
2019-10-19T00:16:55.928835-05:00 magic kernel: [ 3348.108098] [] ? nv_timer_callback_anon_data+0xd/0x10 [nvidia]
2019-10-19T00:16:55.928836-05:00 magic kernel: [ 3348.108104] [] ? call_timer_fn+0x40/0x160
2019-10-19T00:16:55.928850-05:00 magic kernel: [ 3348.108170] [] ? nv_pci_register_driver+0x20/0x20 [nvidia]
2019-10-19T00:16:55.928851-05:00 magic kernel: [ 3348.108174] [] ? run_timer_softirq+0x278/0x350
2019-10-19T00:16:55.928852-05:00 magic kernel: [ 3348.108178] [] ? __do_softirq+0x120/0x300
2019-10-19T00:16:55.928853-05:00 magic kernel: [ 3348.108181] [] ? irq_exit+0x125/0x130
2019-10-19T00:16:55.928854-05:00 magic kernel: [ 3348.108186] [] ? smp_apic_timer_interrupt+0x54/0x70
2019-10-19T00:16:55.928855-05:00 magic kernel: [ 3348.108189] [] ? apic_timer_interrupt+0x196/0x1a0
2019-10-19T00:16:55.928856-05:00 magic kernel: [ 3348.108191]
2019-10-19T00:16:55.928857-05:00 magic kernel: [ 3348.108196] [] ? cpuidle_enter_state+0xdb/0x250
2019-10-19T00:16:55.928858-05:00 magic kernel: [ 3348.108199] [] ? cpuidle_enter_state+0xaa/0x250
2019-10-19T00:16:55.928859-05:00 magic kernel: [ 3348.108202] [] ? cpuidle_enter+0x17/0x20
2019-10-19T00:16:55.928860-05:00 magic kernel: [ 3348.108206] [] ? cpu_startup_entry+0x2ae/0x330
2019-10-19T00:16:55.928861-05:00 magic kernel: [ 3348.108211] [] ? start_secondary+0x1ad/0x1d0
2019-10-19T00:16:55.928863-05:00 magic kernel: [ 3348.108213] Code: c7 e8 23 e6 91 ff 31 c0 48 83 c4 08 c3 66 90 66 2e 0f 1f 84 00 00 00 00 00 48 83 ec 08 39 4a 10 76 17 48 8b 02 c1 e9 02 8b 04 88 <48> 83 c4 08 c3 66 0f 1f 84 00 00 00 00 00 e8 eb 31 00 00 48 89
2019-10-19T00:16:57.210108-05:00 magic kernel: [ 3349.388618] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:59.210121-05:00 magic kernel: [ 3351.388609] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:16:59.211115-05:00 magic kernel: [ 3351.390497] NVRM: Xid (PCI:0000:04:00): 8, Channel 00000001
2019-10-19T00:17:01.214119-05:00 magic kernel: [ 3353.392600] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:03.214108-05:00 magic kernel: [ 3355.392593] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:05.214116-05:00 magic kernel: [ 3357.392584] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:07.214113-05:00 magic kernel: [ 3359.392571] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:09.214126-05:00 magic kernel: [ 3361.392563] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:09.723546-05:00 magic dhcpd: DHCPREQUEST for 192.168.2.45 from 14:78:0b:00:0b:35 (31825717E2451) via eth1
2019-10-19T00:17:09.723572-05:00 magic dhcpd: DHCPACK on 192.168.2.45 to 14:78:0b:00:0b:35 (31825717E2451) via eth1
2019-10-19T00:17:11.214123-05:00 magic kernel: [ 3363.392552] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:13.214121-05:00 magic kernel: [ 3365.392544] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:15.214108-05:00 magic kernel: [ 3367.392533] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:17.214126-05:00 magic kernel: [ 3369.392528] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:18.927633-05:00 magic sshd[160880]: FIPS integrity verification test failed.
2019-10-19T00:17:19.214128-05:00 magic kernel: [ 3371.392515] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:21.214115-05:00 magic kernel: [ 3373.392507] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:22.204841-05:00 magic sshd[160923]: FIPS integrity verification test failed.
2019-10-19T00:17:23.214130-05:00 magic kernel: [ 3375.392497] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
2019-10-19T00:17:23.932144-05:00 magic kernel: [ 3376.103494] NMI watchdog: BUG: soft lockup - CPU#2 stuck for 23s! [swapper/2:0]
2019-10-19T00:17:23.932158-05:00 magic kernel: [ 3376.103498] Modules linked in: xt_nat veth nvidia_uvm(OE) nfnetlink_queue nfnetlink_log nfnetlink ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay tun ip6table_filter ip6_tables iptable_filter ip_tables ipv6 rcim2usb(OE) psysdrv(POE) esdcan_pci200(POE) cbc dm_crypt iTCO_wdt iTCO_vendor_support hp_wmi sparse_keymap rfkill serio_raw pcspkr sb_edac edac_core lpc_ich mfd_core i2c_i801 snd_hda_codec_hdmi e1000e sg xhci_pci xhci_hcd snd_hda_codec_realtek snd_hda_codec_generic snd_hda_intel snd_hda_controller snd_hda_codec snd_hda_core snd_seq snd_seq_device snd_pcm snd_timer snd_hwdep snd soundcore nvidia_drm(POE) drm nvidia_modeset(POE) nvidia(POE) ipmi_msghandler igb dca i2c_algo_bit i2c_core ptp pps_core wmi ext4 jbd2 mbcache2 sr_mod cdrom sd_mod ahci libahci dm_mirror dm_region_hash dm_log dm_mod