Cannot Recognize 4th Tesla

sluke · July 29, 2009, 3:39pm

I just received 4 C1060’s through the developer program and have been working to install them in a nForce 780a motherboard (the Foxconn destroyer that used to be recommended). The machine is currently running ubuntu 8.04. I have downloaded and installed the 185.18.14 version of the driver and everything seemed to go smoothly. However whenever I query the devices with deviceQuery, I find the first one listed is a C1060, the second is the onboard nForce 780a SLI, the third is a C1060 and the fourth is a C1060. Running the typical tests confirms that devices 0, 2 and 3 perform like teslas and device 1 is quite slow.

How do I go about getting the 4th tesla recognized? I thought to disable the onboard graphics, (I don’t mind if the computer is headless). However the bios will not let me do this completely (it will only disable if an external graphics card is detected) and additionially I am not getting cuda to work without starting the x server. I will be happy to post any additional information that is helpful (such as output of lspci) but I didn’t want to prematurely clutter the message board with unnecessary information if this has a trivial solution.

Thanks,
Luke

sluke · July 30, 2009, 2:26am

It turns out my problem was a loose connection. One of the cards was not fully seated in the pci-e slot. Now that all of the devices are recognized I have one more question. I would like to reorder the devices so that the first four devices are the C1060s and the integrated graphics comes last. At the moment the integrated graphics is device 1. Can this easily be changed?

Luke

mfatica · July 30, 2009, 2:26am

You can’t reorder, but with nvidia-smi you can exclude the on-board graphic from running CUDA code.

sluke · July 30, 2009, 4:58am

This sounds perfect, but I cannot figure out how to get it to work. I assume you mean to use nvidia-smi -t? If this is the case I cannot figure out the ID of the onboard card.

I get only

==============NVSMI LOG==============

Timestamp : Thu Jul 30 00:55:14 2009

when I run nvidia-smi without any arguments and

GPU #0: (084C10DE:0D0D105B) nForce 780a SLI

GPU #1: (05E710DE:066A10DE) Tesla C1060

GPU #2: (05E710DE:066A10DE) Tesla C1060

GPU #3: (05E710DE:066A10DE) Tesla C1060

GPU #4: (05E710DE:066A10DE) Tesla C1060

when I run nvidia-smi -L. I have tried nvidia-smi -t 0 to which I get the message: Unit number out of range! Likewise using either of the numbers in parenthesis pops up the help message.

mfatica · July 30, 2009, 6:32am

This will exclude GPU 0 ( 2: Compute-prohibited mode, no compute programs may run on this GPU):

nvidia-smi -g 0 -c 2

If X is not running, you may need to run nvidia-smi in loop mode.
After you have issued the command, check with:

nvidia-smi -g 0 -s

JauntyJack · September 23, 2009, 8:03pm

I ran into that with the bottom Tesla of 4 on my Destroyer Mobo – the cables coming from the USB and 1394a sockets on the bottom of the board press against the mobo side of the bottom Tesla and act as springs trying to force it out of the PCIe slot. Tightening the cable clamps helped with the USB wires, but I ended up disconnecting the 1394a plug completely to solve this issue.

Topic		Replies	Views
Disabling specific CUDA GPUs How do I prevent CUDA code from running on the low-power onboard GPU? CUDA Programming and Performance	4	3480	April 12, 2010
Driver doesn't see my Tesla C1060 CUDA Programming and Performance	5	9685	May 25, 2011
Tesla C1060s not detected I've loaded the latest Tesla drivers but Tesla's not detected CUDA Programming and Performance	3	3643	June 18, 2009
GPU utilization broken in CUDA-4.0 Is patch available? CUDA Programming and Performance	2	2874	August 8, 2011
Tesla problem - only 1 GPU detected CUDA Programming and Performance	2	2103	June 11, 2009
Slow CUDA programs' startup CUDA Programming and Performance	10	7246	January 23, 2012
Where can I find nvidia-smi.exe utility CUDA Programming and Performance	22	144709	January 14, 2023
No CUDA on EVGA 460 GTX 2WIN? Unable to find cuda device with EVGA 460 GTX 2WIN??? CUDA Programming and Performance	4	1767	January 29, 2012
Different performance from different GPUs with Identical Code CUDA Programming and Performance	18	4362	April 11, 2012
nvidia-smi reports 3 GPUs but deviceQuery reports only 2 CUDA Setup and Installation	4	2003	June 23, 2018

Cannot Recognize 4th Tesla

Related topics