How to Build a GPU-Accelerated Research Cluster

jwitsoe · July 30, 2013, 1:56am

Originally published at: https://developer.nvidia.com/blog/how-build-gpu-accelerated-research-cluster/

Some of the fastest computers in the world are cluster computers. A cluster is a computer system comprising two or more computers (“nodes”) connected with a high-speed network. Cluster computers can achieve higher availability, reliability, and scalability than is possible with an individual computer. With the increasing adoption of GPUs in high performance computing (HPC), NVIDIA GPUs…

anon43358632 · December 16, 2013, 3:06am

hi, myself medha, Ph.D student

Working in area of publish subscribe distributed system . I am interested in building GPU accelerated research cluster for my research in the area of design of high performance pub/sub using MPI and CUDA. Can u give specification of infrastructure like node or GPU for purchase. Also I wanted to discuss with u my research area .can u help?

thanks for ur valuable post.

anon2495774 · December 26, 2013, 3:52pm

Thanks for your interest in building a research cluster. The basic inputs about choosing the Nodes (Workstation or server) and GPUs are given in point 1 of my blog above. You can choose either to buy any standard OEM machine or assemble any machine which fulfills the specs given. Please let me know if you have any specific questions about choosing the hardware, I will be happy to answer. Please let me know about your research area and points of discussion, I would be happy to discuss more on that.

anon43358632 · December 30, 2013, 6:13am

thanks for the reply and the interest shown. I am working in the area of publish subscribe system where publishers publishes the work and subscriber subscribes the things of his interest. EXample is stock trading, where subscriber can subscribe to any stock when some conditions satisfies. Matching of subscriptions with publications is called matching algorithm . I am trying to port this pub/sub system on HPC platform, I want to perform hybrid parallelism by using MPI and CUDA . My idea is one node will do the task of clustering and send the subscriptions according to clusters formed to individual work stations. Every work node will have cuda card. Matching will be done by GPGPU. As the publications arrived , the node who does the clustering will approximately choose the node where subscriptions can be found. If this cluster is formed then I can check about latency bandwidth , MPI communications bandwidth etc,
Now my questions are:-

No one has done the porting of pub/sub system on MPI and CUDA.yet. I haven't found any IEEE paper on it. can I go with this idea of forming research cluster and deploying pub/sub system on that? or my concept is itself wrong?
I am pursuing Ph.D and my work is to make pub/sub system parallel and scalable by using HPC.
I have implemented CUDA content matching algorithm and results are promising. Now I want to make it distributed with combination of MPI and cuda.
Also I want to test this system on hadoop and storm which is event processing system. and then conclude about which architecture is suitable for pub/sub system
Pls guide me regarding this. Thanks for everything.

medha

anon2495774 · January 9, 2014, 5:10am

Please drop an email to CUDA-Technology-IN@nvidia.com, we can discuss in detail on that about your research work.

anon93161819 · July 3, 2014, 8:07am

Will this cluster provide any acceleration for molecular dynamics (or docking) software (Amber, schrodinger, MOE etc)

anon2495774 · August 26, 2014, 4:47pm

If your application is getting better performance with GPUs and also scales well across nodes, cluster can help you in getting a good acceleration

anon91479884 · June 4, 2015, 4:28pm

I wanna build a low cost GPU +CPU cluster , im very much confused in selecting the right board . can any help?

anon53305455 · August 6, 2015, 7:18pm

Hi, Hung from Hong Kong.
Teacher in a middle school.

I find the link has been removed. Can you tell from where I can watch your video record and slide for your talk?

Thanks.

E-mail
schrodingeriap@yahoo.com.hk

anon2495774 · August 7, 2015, 12:48am

Please see recording - search at http://on-demand-gtc.gputec...

Search GTC, 2013 and with Title, it will take you a page that will show this talk and will have recording link.

Slides are at http://on-demand.gputechcon...

anon53305455 · August 8, 2015, 12:45am

Get it.
Thanks for your kindness.

Chun Hung

------------------------------
2015年8月7日週五中國標準時間上午8:49 Disqus 的來信﹕

anon78252278 · October 31, 2015, 2:17pm

Can I use GeForce GTX card instead of Tesla

anon74519798 · November 20, 2015, 11:15am

Sir,
I Karishma Bansole.I am doing Mtech.My dessertation work in Parallel computing.I need to establish MPI-GPU Cluster.
Uptill now I have made rock cluster of one node.Now I want to add Cuda roll on rock cluster How should I do? And I dont have infiband.So I would like to know how to established MPI-GPU cluster without infiband?.Or Infiniband is needed for making the MPI-GPU Cluster

anon2495774 · December 10, 2015, 1:45am

Could you please email to CUDA-Technology-IN@nvidia.com about your requirements. We will get back to you.

anon2269034 · December 11, 2015, 8:42am

Multiple GPU's on a single node (Ex: 4-in-one/8-in-one)
OR
one/two GPU per node.
What is the trade-off? Where does it actually make a difference?

anon24913107 · May 23, 2016, 8:59pm

Hi Pradeep,
Any thoughts on how to install a computing cluster for Matlab distributed computing?

Topic		Replies	Views
Fast Multi-GPU collectives with NCCL Technical Blog	14	980	May 11, 2018
An Even Easier Introduction to CUDA Technical Blog	141	6241	November 28, 2023
advice needed by a PhD student CUDA Programming and Performance	26	2854	December 4, 2011
When to use Serial CPU, CUDA, OpenMP and MPI? CUDA Programming and Performance	8	13407	May 29, 2021
HPLinpack for CUDA Any interest? CUDA Programming and Performance	27	11950	May 10, 2012
CUDA 4.0 CUDA Programming and Performance	63	507396	March 28, 2013
GPU Pro Tip: CUDA 7 Streams Simplify Concurrency Technical Blog	51	2096	February 5, 2020
Benchmarking GPUDirect RDMA on Modern Server Platforms Technical Blog	40	2707	April 11, 2019
Boosting Inline Packet Processing Using DPDK and GPUdev with GPUs Technical Blog	17	1857	June 26, 2023
CUDA/OpenCL runs multiple GPUs sequentially CUDA Programming and Performance	16	19323	November 26, 2015

How to Build a GPU-Accelerated Research Cluster

Related topics