Multiple Device Execution

Can anyone point me to a tutorial or some reading material/code examples where i can execute 1 kernel on 2 different devices simultaneously?

simpleMultiGPU - CUDA SDK

Thanks :D

GPUworker. Very good and structured solution

http://forums.nvidia.com/index.php?showtopic=66598