This is a pretty random question, but here it goes…
I am attempting to integrate mpi code into my CUDA enabled program. The goal is to offload background processes to a separate box that I have (2x Tesla cards) rather than trying to fit more GPU cores into my current box (3x 280). The internal gigabit network is set up and I can execute simple openmpi programs (hello world -ish) across the network without any problems. When I try to execute my full CUDA utilizing program, I get the following error.
program: error while loading shared libraries: libcudart.so.2: cannot open shared object file: No such file or directory
The program itself can run on each machine without any problems (i.e. mpirun -np 1 program). I only run into trouble with (mpirun -np 2 -machinefile hostnames program) Any thoughts or suggestions?
My setup
Computer 1
3x 280
Computer 2
9500 + 2x Tesla
Both computers have:
quad core AMD
8GB RAM
Ubuntu 8.04