MPI (MPICH2) and GTX580 using MPI with CUDA

profmaddy · April 11, 2012, 7:27pm

I’m afraid I am probably not going to like the answer to this but…

We are developing a model which uses CUDA kernels to accelerate certain calculations. Eventually we will need to scale this to run across a large number of EC2 GPU compute cluster instances but for now I am trying to just use a two node (three GPU) windows-based cluster to test bed the code. In one machine (running server 2008 R2) we have a GTX570 plus a tesla C2050 (in TCC mode) and in the other machine (win 7) we have a 3Gb GTX580. Now we can run a simple Hello World exe (i.e. no cuda) across both hosts using mpiexec -hosts 2 server 5 win7 5 \someshare\mpiHW.exe without a hitch. However trying to run the simpleMPI example from the SDK results in an error 38 on the win 7 host - i.e. it does not recognise the GTX580 (the code itself runs on the win7 machine without a hitch, just not via MPI).

So… has nvidia restricted MPI to the tesla TCC driver? If so I will seriously have to consider whether I should be developing code for this platform!

Hopefully someone will confirm it is more likely to be an error in our MPI setup as they have this working with the geforce cards across multiple nodes - here’s hoping.

mfatica · April 11, 2012, 9:02pm

It is not an MPI problem, it is related to the WDDM driver model in Win 7.
When you are trying to use a remote connection, unless you use the TCC mode (or go back to XP…) or do something similar to what is explained in this post (The Official NVIDIA Forums | NVIDIA)
you will not be able to access the GPU.

It will work just fine in Linux.

profmaddy · April 11, 2012, 9:45pm

Thanks for your help. That makes sense.

As we will be using linux gpu clusters on EC2 I guess the simplest solution would be to switch to linux for this test cluster (I have no real preference otherthan I like to use the VS2010 IDE). I’ll check out the thread you suggest and then decide.

Gert-Jan · April 12, 2012, 7:50am

We use mpich2 (version 1.4.1p1) for our Linux-GPU cluster (4 PC with 4xGTX 580 each) and it works like a charm.

DrAnderson42 · April 12, 2012, 12:06pm

FYI: If you use MVAPICH2 1.8rc1, you can mpi_send/recv direct from device pointers. And when the send/recv is to/from devices on the same host, it will automatically use IPC to transfer data directly from GPU to GPU without involving the host. This can provide significant performance gains.

profmaddy · April 12, 2012, 6:39pm

Thanks for confirming this will work with linux. Actually I have just been given the opportunity to purchase a couple of M2050s going cheap - I guess these will work using the TCC mode without resorting to a linux OS installation. As we need to run some simulations in a hurry for a grant proposal I will probably forego that pleasure for now if at all possible ;-)

Topic		Replies	Views
MPI+CUDA+Windows no cuda capable device when starting with mpiexec CUDA Programming and Performance	2	2278	June 22, 2011
MPI does not detect 2 GPUs MPI program gives "no CUDA-capable device is detected" CUDA Programming and Performance	2	10157	December 23, 2012
problem with multi gpu using mpi Legacy PGI Compilers	2	2192	December 2, 2015
Tesla Compute Cluster driver released non-display driver for 64-bit Windows Server 08/08 R2 CUDA Programming and Performance	37	30493	October 21, 2014
[VMWARE] How to? CUDA Programming and Performance	5	6053	November 20, 2012
Using MPI+multi-GPUs with CUDA 4.0 CUDA Programming and Performance	5	786	June 9, 2011
Tesla Drivers for Win HPC 2008 RC2 CUDA Programming and Performance	8	14707	December 22, 2009
CUDA Remote Desktop Trying to enable Tesla2050 working on RDC CUDA Programming and Performance	8	2374	December 25, 2010
CUDA on computer with TESLA as a General purpose device CUDA Programming and Performance	7	6957	March 8, 2011
using all 4 GPUs in S1070 from multi-core cpu? how CUDA Programming and Performance	11	32432	December 13, 2010

MPI (MPICH2) and GTX580 using MPI with CUDA

Related topics