Three Ultra cards + P6N Diamond performance issue

jtoelke · September 5, 2007, 8:03am

Hello,

I have the new P6N Diamond mainboard from MSI, nVidia nForce 680i SLI chipset.
I have installed 3 cards. If I do independent parallel computations (no communication over CPU) with only two cards (every combination is fine), I get on every card 50 work units per second done which is fine. If I do computations on three cards I get in sum only 105 work units done! the performance of two cards drops down to say 25 and 30 work units. The third card gives full perforamance . This behavior is not static meaning that the perormance drop down is ‘switching’ from card to card. If I kill the third computation on either card, everything is fine again (50+50). Does anybody have this mainboard with three cards ? Does anybody have installed three cards on a board and has experiencded similar things ?

Thank you very much!

NSmyrnos · September 6, 2007, 6:30am

Hello,

I have the new P6N Diamond mainboard from MSI, nVidia nForce 680i SLI chipset.

I have installed 3 cards. If I do independent parallel computations (no communication over CPU) with only two cards (every combination is fine), I get on every card 50 work units per second done which is fine. If I do computations on three cards I get in sum only 105 work units done! the performance of two cards drops down to say 25 and 30 work units. The third card gives full perforamance . This behavior is not static meaning that the perormance drop down is ‘switching’ from card to card. If I kill the third computation on either card, everything is fine again (50+50). Does anybody have this mainboard with three cards ? Does anybody have installed three cards on a board and has experiencded similar things ?

Thank you very much!

[snapback]246691[/snapback]

I took a look at your board on newegg and noticed this:

Expansion Slots

PCI Express x16 4 (The 4 PCI Express interface will operate at either x8+x8+x16+x8 or x16+x16+x8 mode)

Which slots do you have the affected cards in? Is their a BIOS options to modify the PCI-E lane option? Could be that it defaulted to the first option presented and your first two cards are only running in x8 mode. Whether that will have a noticeable impact in CUDA probably depends greatly on your program, but the 8800GTX and Ultra are practically the only cards that show a noticeable performance change going from x8 to x16 PCI-E bandwidth in 3d games. I wouldn’t be completely shocked to see a potential performance drop in some CUDA apps if this is in fact the case. :)

jtoelke · September 6, 2007, 6:57am

That could be, I have to contact nVIDIA+MSI … But I do only computations on the graphics card and I have only some ascii text printed out, so almost no data transfer Over pci-E. Nevertheless there could be an automatic lowering of the computing performance when going down to X8 bandwidth ?

levicki · September 30, 2007, 11:21pm

Try using a quad-core CPU if you don’t have it already.

jtoelke · December 10, 2007, 9:41am

Thats the solution! It is important to mention that for all developers that at least one core per GPU is needed to run effieciently!

nasacort · December 19, 2007, 8:00pm

Hello,

I have a general question. In order to use 2 or more cards for computing, do I have

to use a motherboard with SLI support? Also, do I have to disable SLI mode when

using multiple cards as it says in 3.4 of the Programming Guide?

Thank you.

netllama · December 19, 2007, 8:01pm

CUDA doesn’t use SLI. Thus, you don’t need an SLI capable motherboard to use CUDA.

rockypg · January 2, 2008, 7:47am

Can someone please throw some more light on whether this is true? How does having one core per GPU help?

MisterAnderson42 · January 2, 2008, 3:30pm

Because CPU threads busy wait in a spin loop (with a thread yield in it) when synchronizing GPU and CPU. If you have one core busy waiting on 2 GPU’s, there will be significant delays introduced.

Topic		Replies	Views
Four 8800GTX on a single mainboard CUDA Programming and Performance	15	19695	December 10, 2007
how to achieve equal bandwith to 3 GPUs in CUDA? (searching for recent motherboards) CUDA Programming and Performance	13	6443	January 11, 2009
Hardware Suggestions Random thoughts and ideas for nVidia CUDA Programming and Performance	5	2146	May 13, 2009
multiGPU poor performance up to 10x lowest performance in multiGPU CUDA Programming and Performance	14	10764	January 18, 2008
Using more than 1 CUDA card at a time. Physics simulations flat out flying on GPU CUDA Programming and Performance	12	12541	March 12, 2010
Is CUDA right for me? (FDTD) FDTD user needs fast computations while handling massive 3-D arrays CUDA Programming and Performance	17	22416	December 29, 2008
Titan X Pascal scaling with 4 cards ... problems? CUDA Programming and Performance	10	2325	August 27, 2016
GPU Utilization Drops after Consecutive Executions CUDA Programming and Performance	28	5717	October 2, 2013
CUDA hardware & software CUDA Programming and Performance	9	2665	November 13, 2010
Quadro RTX 8000 Multi-GPU Performance Issue CUDA Programming and Performance	13	1166	March 8, 2025

Three Ultra cards + P6N Diamond performance issue

Related topics