Single dual-GPU card vs. 2x single GPU cards

nightflight · August 14, 2014, 8:24am

Hello Colleagues,

As a newcomer to CUDA Development, I need your advice on upgrading a system.

In our application, we have two fast sCMOS cameras. After some independent computation (demosaicing, spatial filtering and more), the images from the two cameras are overlaid using OpenGL. All computations are in single precision. The whole thing is supposed to run in real time at high frame rate, which is why we’re upgrading our system.

Clearly, since the computations on the images from the two cameras are independent (but slightly different), it makes sense to do them in parallel on two GPUs. Since we don’t quite have the budget for the Tesla cards, we’re considering buying two GTX 780 Ti cards or a single GTX Titan Z (dual-GPU). Since the Titan Z is the more expensive option, I would like to hear your opinions on any advantages it may confer versus the two-card solution in our application (other than using less power). One important factor is clearly the speed of peer-to-peer copy on the Titan Z, but I couldn’t find the relevant information.

As a side question, why is the Tesla K10 listed as “server only”?

Thank you in advance!

sBc-Random · August 14, 2014, 9:50am

There are a few pros and cons to consider

For the Titan Z, the cards I believe have slightly higher performance, but are much more expensive. I believe they are intended to also be a lot more reliable (NVIDIA accumulate all the better functioning processors from a batch and put them in their higher end cards.) whereas 780Tis are intended for less critical applications first and foremost.

The peer to peer is a significant point.
The Titan Z has a 16 lane pcie switch on board between the two cards, so it’s effectively losslessly adding on an extra PCIE slot to your motherboard. If your motherboard has 2 spare double wide pcie slots with dedicated 16 lanes each and theyre peer enabled, then the 780 is probably the winner. If, on the other hand theyre not peer enabled, or one is 4 or 8 lanes, then I would go with the Titan Z, but this depends on what the application is, particularly how much inter gpu communication is required.

Regarding the “server only” question:
Nearly all (with the exception of the K20 I think) K cards are PASSIVELY heated. This is VERY important to note - desktops simply can’t cool the K cards quickly enough, they overheat and damage (And I’m not talking in months - minutes, hours is the timeframe). Another important difference (which I’ve personally struggled with recently on consumer cards), is that for K cards the power sockets are on the back end of the card, whereas consumer cards have the power socket on the side, which can really clutter the space on a motherboard.

nightflight · August 14, 2014, 11:41am

Hi sBc-Random,

Thanks for answering.

Just to make sure I understand right – are you saying that the communication between the two GPUs on the Titan Z goes over its internal PCIe switch, and that this is slower than going through the motherboard? There is no kind of faster direct communication?

BTW, we have the ASUS P9X79E WS motherboard ([url]http://www.asus.com/Motherboards/P9X79E_WS[/url])

nightflight · August 14, 2014, 11:44am

The motherboard review on AnandTech ([url]http://www.anandtech.com/show/7613/asus-p9x79e-ws-review[/url]) provides a diagram of the PCIe lane layout if that helps.

sBc-Random · August 14, 2014, 12:26pm

No - it will be exactly the same speed as a top of the line motherboard connection. There’s no sort of proprietary connection with greatly improved transfer speed, but it will certainly be as good as you can get between two gpus.

That motherboard would do nicely for the 780s as they would run let’s say 95% the speed between the gpus for much much cheaper, I would say go with the 780s. Mind you if the bios supports it you could POTENTIALLY fit in 4 titan Zs which is 8 gpus, as opposed to 4 780s. Really depends how many cards you need (I have 16 :P)

nightflight · August 14, 2014, 12:38pm

Hi sBC-Random,

Much appreciated. We only need 2 cards for the two types of images, and maybe a third one just for display.

Just to confirm – copying from one GPU to the other is staged through the CPU Memory in either cases, right?

sBc-Random · August 14, 2014, 1:27pm

No - that’s the point of peer to peer

Enabling Peer to peer copies means that instead of GPU0>CPU>GPU1 it becomes GPU0>GPU1. Roughly 50% faster in my experience. But can vary…

alexgg · August 15, 2014, 11:45am

That’s 1500W just for the GPUs. You’ll probably need a 2-2.5 kW PSU. The most powerful retail PSU is 1.6kW, and combining them is hard and/or dangerous, I think.

I’d like to know more about your setup!

alexgg · August 15, 2014, 9:37pm

By the way, this person 18 GPUs in a single rig and it works - CUDA Setup and Installation - NVIDIA Developer Forums reports having 18 GPUs, although it looks like the PCIe bandwidth is pretty bad.

Topic		Replies	Views
CUDA and Titan Z CUDA Programming and Performance	7	4798	June 1, 2014
Confused about GTX Titan Z Peer-To-Peer (P2) capability CUDA Programming and Performance	19	5072	February 23, 2015
Recommended setup for trial GPU computing CUDA Programming and Performance	9	1595	August 8, 2013
Advice on single vs multi-GPU system CUDA Setup and Installation	8	2788	May 19, 2014
Basic question about 2-in-1 GPUs (ie GTX Titan Z or K80) CUDA Setup and Installation	2	1763	November 28, 2014
Peer-to-peer transfer failing on GeForce GTX Titan Z CUDA Programming and Performance	17	3806	April 21, 2015
CUDA hardware & software CUDA Programming and Performance	9	2664	November 13, 2010
Bios usage of dual cards CUDA Programming and Performance	18	5133	July 16, 2014
Tesla CUDA Programming and Performance	13	8805	August 3, 2007
Titan X Pascal scaling with 4 cards ... problems? CUDA Programming and Performance	10	2323	August 27, 2016

Single dual-GPU card vs. 2x single GPU cards

Related topics