Profiling MPI+Cuda

I’m developing a MPI+cuda project and I’ve tried to profile my app with nvvp and nvprof but, in both cases it doesn’t give a profile. The app works completely fine, but no profile is generated.

======== Warning: No CUDA application was profiled, exiting

I tried with simpleMPI cuda example with the same result.
I’m using CUDA 5.0 in a 580 GTX and openMPI 1.7.3 (featured, not release yet because I’m testing the CUDA-aware option)

Any ideas? Thank you very much.

It was already solved in stackoverflow
Thank you!