Optix 6.5 - Multi-GPU

sukumar.srikanth · April 3, 2020, 9:11am

I want to translate my Optix Application that is currently running on version 6.5 into a Multi-GPU setup.
From the documentation I did infer that by default it does use all of the available GPUs. Which it does, but I am having a slower performance than a single GPU setup. With regards to that I have some questions, I have quite a number of buffers marked as RT_BUFFER_OUTPUT (roughly 10 or so). What happens to these buffers in a multi gpu setup ? Is there a copy of each of them in all the gpus and there is a sync step that happens after the computation is done ? Or all the buffers reside on the host and the data is computed and transferred via PCIe ? Does the same happen for RT_BUFFER_INPUT ?

droettger · April 3, 2020, 9:57am

Please have a look into the following threads about multi-GPU topics on OptiX 6 and earlier:
https://forums.developer.nvidia.com/t/cuda-optix-gpu-utilisation/58621
https://forums.developer.nvidia.com/t/multi-gpu/40472
https://forums.developer.nvidia.com/t/question-about-handling-buffers-when-using-multiple-gpus/54011
https://forums.developer.nvidia.com/t/very-poor-multi-gpu-scaling-on-dgx-1/67139
https://forums.developer.nvidia.com/t/createbufferfromglbo-function-crash-in-multi-gpu-environment/62060/4
Look for “pinned memory” and RT_BUFFER_GPU_LOCAL inside these explanations.

There are also topics inside the OptiX 6.5.0 programming guide touching multi-GPU:
https://raytracing-docs.nvidia.com/optix6/guide_6_5/index.html#cuda#interoperability-with-cuda
https://raytracing-docs.nvidia.com/optix6/guide_6_5/index.html#performance#performance-guidelines

That said, with OptiX 7 you would have explicit control about any multi-GPU behavior because OptiX 7 itself knows nothing about multiple devices. That part is completely handled by the CUDA host code you control!

The OptiX 7 applications linked here contain one example which shows different methods to distribute the rendering workload of one frame over multiple GPUs:
https://forums.developer.nvidia.com/t/optix-advanced-samples-on-github/48410/4

Topic		Replies	Views
Multi-GPU with several float buffers OptiX	5	1306	June 14, 2022
RT_BUFFER_INPUT_OUTPUT \| RT_BUFFER_GPU_LOCAL question OptiX	2	968	October 12, 2021
Question about handling buffers when using multiple GPUs? OptiX	14	3861	June 15, 2022
Multi-GPU with OptiX OptiX	10	5451	June 14, 2022
How can i set size for distributed render? OptiX	7	672	June 14, 2022
Multi GPU OptiX	7	3131	June 14, 2022
[Solved]Can you run multiple instances of an application using Optix on a single GPU ? OptiX	4	1068	June 14, 2022
OptiX Prime: disable automatic use of multiple GPUs? OptiX	5	721	June 14, 2022
Host-device transfer bottleneck OptiX	4	1072	June 14, 2022
Progressive photon mapping sample with multiple GPUs OptiX	7	1905	June 14, 2022

Optix 6.5 - Multi-GPU

Related topics