Dear CUDA people,
my TESLA S1070 connected to a HP ProLiant 785 G5 server with Ubuntu linux 8.04 installed is making trouble.
After reboot, no nvidia device is present in /dev/.
I tried different drivers (177.70.11, 177.70.18, etc). After trying you little script of cuda 1.1 release notes, I have all devices in /dev/, but deviceQuery is very slow (right information after 5 min.). furthermore, I get kernel logs:
Nov 19 17:03:25 giselle kernel: [ 1026.903369] warning: process `nvidia-installe’ used the deprecated sysctl system call with 1.23.
Nov 19 17:04:27 giselle kernel: [ 1088.330866] PCI: Setting latency timer of device 0000:87:00.0 to 64
Nov 19 17:04:27 giselle kernel: [ 1088.331025] PCI: Setting latency timer of device 0000:89:00.0 to 64
Nov 19 17:04:27 giselle kernel: [ 1088.331193] PCI: Setting latency timer of device 0000:c7:00.0 to 64
Nov 19 17:04:27 giselle kernel: [ 1088.331348] PCI: Setting latency timer of device 0000:c9:00.0 to 64
Nov 19 17:04:27 giselle kernel: [ 1088.331502] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 177.70.11 Tue Sep 9 16:26:11 PDT 2008
Nov 19 17:05:23 giselle kernel: [ 1144.096083] Uhhuh. NMI received for unknown reason b1.
Nov 19 17:05:23 giselle kernel: [ 1144.096233] You have some hardware problem, likely on the PCI bus.
Nov 19 17:05:23 giselle kernel: [ 1144.096319] Dazed and confused, but trying to continue
Please see attached nvidia-bug-report.log. I’m not sure whether it is a hardware failure (Host,Bridge,Device??) or OS/Driver problem.
Best regards,
Nico.
nvidia_bug_report.zip (38.5 KB)