I currently have a program that uses CUDA to generate an image.
This image is rendered using opengl interoperability (using a pbo)
I want to move this to a dual GPU setting, where half the image is calculated by each GPU
(in general, we would have small parts of the image computed by several gpus)
I want to know the fastest way of displaying these ‘pieces’ of the image as one full image, using OpenGL.
Specifically, I am wondering if it is possible to somehow create one opengl context and one cuda context on each GPU, and render the partial images together on a single window.
Also, if I use a single OpenGL context, is it possible to use opengl interop where opengl context is on one gpu and the cuda program is running on a different GPU??
(any other possible way of doing this is welcome)