In Multi GPU implementations , any well known algorithms used in generic?

I want to know like what are well know Algorithms used / available when we are using the Parallel programming in Multi GPU enviroment.

These are not algorithms, but both OpenMP and MPI are commonly used to distribute work among multiple processors, both in the CPU space and the GPU space.

Thanks… i will look more on this.