MPI+CUDA mixed programming - Error

I’m using MPI+CUDA mixed mode to program a GPU cluster for matrix multiplication. When I offload the multiplication operations to the GPUs via MPI and CUDA, it gives an error message at run time:

FATAL: Error inserting nvidia (/lib/modules/3.2.0-23-generic-pae/kernel/drivers/video/nvidia.ko): No such device

Please note that this is only when I try to use the MPI with CUDA. CUDA only version works well. Thanks in advance.

I am working on a similar problem. Did you find the solution to your problem?