GTX 560 Ti - randomly crashes/hang with official driver

Card model: MSI N560GTX-Ti Hawk
CPU: AMD FX-8350
Motherboard: GIGATYBE 990FXA-D3
RAM: Kingston DDR3 1600 8GB

OS: Fedora 17 64bit
kernel: upstream by yum
driver version: form 310 series to 313 series(using 313.18 now)

Pictures when a problem occurred:

This problem actually occurred on my dual-boot Win7 64bit too. But the problem on my Windows was not so critical, because Windows will just popup a error message, reloading the driver and everything back to work.

On Linux/Fedora - it cause system not responding, just have to reboot the PC. Once Fedora is my primary work environment, so I want to deal with this problem on Fedora first.

I can’t figure out how the problem come from, I can confirm that the problem only occurred when the web browser (Chrome or Firefox) is running, and I never have this problem when playing 3D games(CS:Source, Killing Floor, Serious Sam: BFE and Unigine benchmark).

Addition information:
http://darkranger.no-ip.org/uploads/images/nvidia/nvidia-bug-report.log.gz
http://darkranger.no-ip.org/uploads/images/nvidia/nvidia-settings.png

This problem actually occurred on my dual-boot Win7 64bit too

It seems to be hardware problem.

Did you check your hardware?
Did you run some diagnostic tool, such as memtest86+?

OK, I ran memtest86+ today, and the result seems fine.

Also I never got crash when doing some heavier task like video editing/encoding, software compiling and 3D games. So it hard to find out what’s the problem on my hardware.

Obviously Faulty Video RAM, RMA the card.

I have followed your advice to do the RMA, and they have given me a new one.

Hope it will solve the problem, thanks.

Did it solve the issue?

Hello,

I have similar problem symptoms with my 560Ti but the freeze only happen when my computer is idle for several hours with or without screensaver. If I keep using the computer I never experience the problem.

I run ubuntu 13.04 but had the problem with 12.04 and 12.10 with various drivers, the latest from 319.23 or the 310 or 304 packed with ubuntu.

I don’t really beleive this is a hardware problem as using the opensource “nouveau” driver my system has ran more than 45 days in a row without problem. But I need the nvidia driver as I use darktable and need the opencl libs.

In my case sometimes hardreboot is the only way but sometimes I can still ssh my machine from another computer and soft reboot.

I found that in the syslog I have always the same paterns of message, see bellow.

Does somebody have any clue ?

==============
Jun 8 00:50:09 sesame kernel: [ 9490.342715] NVRM: GPU at 0000:01:00: GPU-b461abdf-e006-78b8-aa18-a93b836d738e
Jun 8 00:50:09 sesame kernel: [ 9490.342718] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 00:50:09 sesame kernel: [ 9490.361512] NVRM: Xid (0000:01:00): 31, Ch 00000001, engmask 00000101, intr 10000000
Jun 8 00:50:18 sesame kernel: [ 9499.321065] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 00:50:34 sesame kernel: [ 9515.279452] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 00:50:58 sesame kernel: [ 9539.215055] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 00:50:58 sesame kernel: [ 9539.224466] NVRM: Xid (0000:01:00): 13, 0001 00000000 0000902d 00000220 00007f8c 0000000c
Jun 8 00:51:14 sesame kernel: [ 9555.173448] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 00:51:30 sesame kernel: [ 9571.131843] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 00:51:46 sesame kernel: [ 9587.090240] NVRM: Xid (0000:01:00): 8, Channel 00000001

and later…
Jun 8 01:34:21 sesame kernel: [12134.833537] NVRM: Xid (0000:01:00): 8, Channel 00000001
Jun 8 01:34:29 sesame kernel: [12137.140462] NVRM: Xid (0000:01:00): 38, 0001 00004000 00000000 00000000 00000000 00000000
Jun 8 01:34:29 sesame kernel: [12139.638378] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Jun 8 01:34:29 sesame kernel: [12141.632703] NVRM: os_schedule: Attempted to yield the CPU while in atomic or interrupt context
Jun 8 01:34:29 sesame kernel: [12141.968657] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
Jun 8 01:34:29 sesame kernel: [12142.208539] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
Jun 8 01:34:29 sesame kernel: [12142.208551] [sched_delayed] sched: RT throttling activated
Jun 8 01:34:31 sesame kernel: [12144.339128] irq 16: nobody cared (try booting with the “irqpoll” option)
Jun 8 01:34:31 sesame kernel: [12144.339134] Pid: 0, comm: swapper/2 Tainted: PF W O 3.8.0-23-generic #34-Ubuntu
Jun 8 01:34:31 sesame kernel: [12144.339136] Call Trace:
Jun 8 01:34:31 sesame kernel: [12144.339138] [] __report_bad_irq+0x3d/0xe0
Jun 8 01:34:31 sesame kernel: [12144.339152] [] note_interrupt+0x1c2/0x210
Jun 8 01:34:31 sesame kernel: [12144.339158] [] ? cpuidle_wrap_enter+0x58/0xa0
Jun 8 01:34:31 sesame kernel: [12144.339161] [] ? centrino_target+0x370/0x370
Jun 8 01:34:31 sesame kernel: [12144.339165] [] handle_irq_event_percpu+0xa7/0x1f0
Jun 8 01:34:31 sesame kernel: [12144.339169] [] ? centrino_target+0x370/0x370
Jun 8 01:34:31 sesame kernel: [12144.339172] [] handle_irq_event+0x4e/0x80
Jun 8 01:34:31 sesame kernel: [12144.339177] [] handle_fasteoi_irq+0x5a/0x100
Jun 8 01:34:31 sesame kernel: [12144.339183] [] handle_irq+0x1e/0x30
Jun 8 01:34:31 sesame kernel: [12144.339188] [] do_IRQ+0x5a/0xe0
Jun 8 01:34:31 sesame kernel: [12144.339192] [] common_interrupt+0x6d/0x6d
Jun 8 01:34:31 sesame kernel: [12144.339193] [] ? cpuidle_wrap_enter+0x58/0xa0
Jun 8 01:34:31 sesame kernel: [12144.339200] [] cpuidle_enter_tk+0x10/0x20
Jun 8 01:34:31 sesame kernel: [12144.339204] [] cpuidle_idle_call+0xa5/0x260
Jun 8 01:34:31 sesame kernel: [12144.339209] [] cpu_idle+0xaf/0x120
Jun 8 01:34:31 sesame kernel: [12144.339212] [] start_secondary+0x1e0/0x1e5
Jun 8 01:34:31 sesame kernel: [12144.339214] handlers:
Jun 8 01:34:31 sesame kernel: [12144.339219] [] usb_hcd_irq
Jun 8 01:34:31 sesame kernel: [12144.339301] [] nv_kern_isr [nvidia]