MPI and CUDA C

NAMD · December 4, 2009, 7:04pm

Hi I am planning to use tera grid cluster( http://www.ncsa.illinois.edu/UserInfo/Reso...64TeslaCluster) to run my Monte Carlo fortran codes,with CUDA C,using fortran wrapper.I am planning to run it on multiple nodes,each node having some tesla cards.MPI is used to communicate between processors.But I am confused that,when I run my code,each processor will need a GPU to run it’s part of the CUDA C code,hence each might compete for a particular Tesla card.How can I have control over this competition.Is there any way to do that?

cgorac · December 4, 2009, 8:08pm

In your MPI machine file, specify single process to be run per machine with GPU card attached?

NAMD · December 4, 2009, 9:52pm

Can you please provide a sample file with the instruction.Thanks a lot for your answer

cgorac · December 5, 2009, 9:37am

I can’t, as I have no clue either about the MPI installation type (MPICH/OpenMPI/…?) on this cluster, nor about cluster nodes/network configuration. But you should be able to find about machine files (and how to point to one, when starting your MPI program) in your MPI installation documentation, and you should then talk with sysadmin(s) of this particular installation about nodes available for your work, their configuration etc., so that afterward you should be able to prepare machine file.

Principally, machine file is simply a list of nodes, one machine listed per line of this file, on which to run your MPI program; so my initial suggestion was to just put each machine with a GPU attached into this file once, as this way each process launched will know it has exclusive access to the GPU on the corresponding machine; this is probably too simplistic for what you eventually intend to do, but at least it should be something to start with.

Side note: I think you should not crosspost your questions.

NAMD · December 5, 2009, 5:24pm

Thank you very much.I am new to the forum,I will not crosspost again.Thank you.

gshi · December 5, 2009, 5:53pm

Your link is broken

I assume you are going to run on NCSA’s lincoln cluster (http://www.ncsa.illinois.edu/UserInfo/Resources/Hardware/Intel64TeslaCluster/)

On that machine, cpu core: gpu =8:2 for each node.

If each of your MPI process uses one GPU, then you probably want to run two MPI processes on each node.

-gshi

NAMD · December 5, 2009, 5:57pm

Yes I am using the same cluster,which you have stated.I will try out your suggestion.Thanks a lot.Can you provide me some script file to run on Lincoln cluster with MPI and Cuda C.

NAMD · December 6, 2009, 8:31pm

Do I need to include some logic in my code ,like cudasetdevice() as well so that there will be one to one mapping with processor and GPU??

Praveen_PVS · December 7, 2009, 5:07am

Hello

Yes, you have to include some logic inside the code so that it would be easy to map.

It can be like this:

If there is a card connected to that processor ( this information can be got from cudagetdeviceproperties and other run time routines), the job can be submitted to GPU. If it is not there it will be run by the processor itself or if they are more than one card connected to that machine, then by cudasetdevice routine we can map each processor to one particular card which gives us better results.

we have see how many gpu’s are there on that machine, if the machine is quad core but only one card is connected to that machine, then it wont be much useful to us. Because we can run only one kernel on the card at a time, so all kernels submitted by each processor will be in queue which will not give us good results.

Topic		Replies	Views
Using MPI with CUDA C CUDA Programming and Performance	2	1328	December 5, 2009
CUDA and MPI Cluster Computing Implementation. Need advice for setting up MPI and CUDA over a cluste CUDA Programming and Performance	2	2478	February 19, 2010
CUDA Cluster Programming Any1 Experienced? CUDA Programming and Performance	12	7047	December 5, 2008
Using MPI+multi-GPUs with CUDA 4.0 CUDA Programming and Performance	5	778	June 9, 2011
PVM codes CUDA Programming and Performance	12	4581	March 31, 2010
Mutual exclusion MPI Windows CUDA Programming and Performance	6	3533	May 18, 2011
How to run these sample multi-gpu programs CUDA Programming and Performance	6	307	July 18, 2024
Mixed Programing combining MPI and CUDA CUDA Programming and Performance	19	6454	May 5, 2009
Question about CUDA+MPI Legacy PGI Compilers	3	2627	March 13, 2018
Parallelize across CPU and GPU cores simultaneously Legacy PGI Compilers	3	5220	January 6, 2016

MPI and CUDA C

Related topics