how to run two __global__ funtions simultaneously in two CUDA devices

Hi,all.
i’m new in CUDA programming. As i have two GTX560 cards, i want to know how to run two global funtions simultaneously in two desktop GeForce CUDA-suppoted cards respectively,
or can i run these global functions sequentially in one card ?

thanks,
best regards