I have a problem concerning scalability on NVIDIA platforms using OpenCL.
In the OpenCL-forum by KHRONOS, I was told, that nothing is wrong from the OpenCL-spec view. [post=‘OpenCL KHRONOS Forum’]http://www.khronos.org/message_boards/viewtopic.php?f=28&t=3831&p=11489#p11489[/post]
In a multi-GPU environment, I create a commandQueue for each device. My expectation is that the execution of those commandQueue is more or less interleaved. However, on NVIDIA, their execution is sequential. Are there known issues? Or is it my mistake?
I tested on a Tesla S 1070 System, as well as 2 x GeForce 8800 GTX, and 2 x GeForce 9800 GX2. None of this system supports scaling.
If you need further details, I will provide them!
Thanks for your help!