MPI+CUDA Testing the capabilities of different cards

Hi, all.
I have to machines with 2 cards in each. One has Geforce 8800 GTX and another has GeForce GTX 480.
Is there any MPI+CUDA benchmark? Which MPI do you use when you develop applications with CUDA support?