GPU clusters for HPC

xubair · September 29, 2011, 12:26pm

HELLO,
I wish to work on GPU clusters for HPC. Please suggest which type of GPUs should I use for making cluster?
From where should I start? Please help…

tementy · September 29, 2011, 3:13pm

NVidia provides Tesla solutions in 1U boxes with interconnect cables and host cards: http://origin-www.nvidia.com/object/tesla_computing_solutions.html
You can build your onw (simple) cluster using PCs with multiple-PCIE motherboards and GeForce/Quadro/Tesla cards. Tesla and Quadro solutions are more expensive, but they have such advantages as ECC-powered serious memory amounts and full-speed double precision computations (GeForce cards have less double-precision capabilities).
The next step is software. You can prefer Unix-like operation systems (Linux, for example). I’m not familiar to cluster building (yet External Image ) and I can only advice you to read this google answer, that provides useful links: Google Answers: How to build a Linux cluster

mfatica · September 29, 2011, 3:29pm

For an HPC cluster, Tesla GPUs with ECC support and full double precision are the right choice.

All the major OEMs and several smaller companies offer 1U/2U servers with Tesla inside and even preconfigured clusters.

xubair · September 30, 2011, 8:24am

Thanks all for reply. Actually I want to know that how can I run two GPU’s in parallel with host CPU that divide workloads b/w those GPU’s depending on the application requirement… I want to work on LINUX Environment…which softwares do I need to use???..I mean CUDA will be use to handle single GPU???..what do to if to make GPU work in parallel??? From where should I start after purchasing Graphic cards…???

tementy · October 1, 2011, 1:52am

The only way to make CUDA GPUs efficiently work together is manually distribute load between them. This will be task-specific approach. CUDA programming guide can help you - there is a section about using multiple devices. SLI technology is for graphics only, not for GPGPU.

Softwares you’ll need: NVidia Driver, CUDA Toolkit, and (optionally, but required for concepts learning) GPU Computing SDK Code Samples. The list of links is allocated on cuda download page: http://developer.nvidia.com/cuda-toolkit-40
First, choose operating system supported by NVidia SDK, and install it. The next step will be driver installation - you can prefer DevDriver from CUDA Download page (it’s not required, but guarantees compatibility with toolkit).

If you prefer Linux, I can advice you something RPM-based, like Fedora (user-friendly External Image ). One year ago i had problems installing driver on Ubuntu.

Sarnath · October 1, 2011, 2:19pm

Whatever GPU you use, do “not” use an Operating System:

That which occupies more than 1/2 of System RAM just to sustain itself
That which is busy drawing rectangles and squares on screen than listening to what its user wants
That which stands like a 500 pound guerilla between you and GPU performance.

KenjiK · February 22, 2012, 2:36am

Hi everyone,

Sorry, I’m a newbie but looking at this thread it seems to imply that it is possible to cluster GeForce cards for some HPC gain? My CUDA application works fine with single precision on my GeForce GTX 460M and I’m pretty sure most of my processing bottleneck is just a need for more threads (it’s a relatively low memory program). So I’m thinking theoretically I could throw like, 10 GeForce cards at my program and see some nice speedup? This is assuming I can find a suitible chassis, enough CPU cores, PCIe slots, etc. I was just wondering if there was any sort of inherent property of the GeForce series that puts a limit as to how many GeForce cards I can cluster. Thank you!

Topic		Replies	Views
Is GeForce HPC possible? CUDA Programming and Performance	3	5934	February 27, 2012
Developing CUDA applications to a cluster. Teaching and Curriculum Support	0	1108	April 10, 2014
Setting up a GPU Cluster CUDA Programming and Performance	2	1546	September 1, 2018
How to Build a GPU-Accelerated Research Cluster Technical Blog	15	747	May 23, 2016
advice needed by a PhD student CUDA Programming and Performance	26	2854	December 4, 2011
Basic Hardware & CUDA Configuration Question System Setup CUDA Programming and Performance	1	2367	May 11, 2008
HPL2 for Nvidia Tesla Platform CUDA Programming and Performance	12	2897	January 11, 2016
GPU based cluster CUDA Programming and Performance	2	719	November 25, 2015
Tesla K20 and GeForce in one machine possible? CUDA Setup and Installation	10	7626	September 26, 2022
New System Question CUDA Programming and Performance	6	6136	December 4, 2007

GPU clusters for HPC

Related topics