Optix7.0: Could I use two streams for two optixLaunch operation in two threads for speed-optimize?

zhoubosheng · October 10, 2020, 9:25am

In my scene, I need render two pass by using the same meshes. So, could I use two threads for the two passes ?

droettger · October 12, 2020, 8:24am

The optixLaunch calls are asynchronous since they have their own stream argument.
Even multiple optixLaunch calls from different threads with different streams on the same CUDA context are possible, but currently that specific case requires separate OptixPipeline arguments though. Explained here:
https://raytracing-docs.nvidia.com/optix7/api/html/group__optix__host__api__launches.html#ga089e2a00833cb952276c5d6e09b692da

EDIT: Thinking about that, I actually wouldn’t expect parallel launches of the same pipeline on different streams to work. It’ll break the launch parameter contents which get copied to constant memory and you can’t change them when using the same pipeline. Means always use different pipelines when launching to multiple streams in parallel on the same CUDA context, irrespective of doing this from a single or multiple host threads, until future OptiX versions say otherwise.

If your renderer passes use different pipelines anyways that should just work.
But if the two passes are depending on each other in some way, e.g. accumulating to the same result buffers, then using multiple streams doesn’t make sense.

Also note that the default stream zero has a different synchronization semantic. Means you need to use explicitly constructed CUDA streams for these parallel asynchronous launches. You can also create the CUDA context to have secondary streams be not synchronized against the default stream zero.

If that actually shows any performance benefit depends on the workload and the underlying hardware.
I can easily load a GPU to almost 100% with a single thread and single stream when using OptiX 7 for big enough workloads, at which point a second stream on the same GPU wouldn’t make anything faster.
Your mileage may vary.

Topic		Replies	Views
Concurrent kernel/optix launch OptiX	7	914	June 14, 2022
Run optixAccelBuild asynchronously OptiX c-plus-plus , ray-tracing	5	1408	December 8, 2021
Two programs launch on one GPU concurrently OptiX	5	69	October 31, 2024
Concurrent 'launch' on same context possible!? OptiX	2	1178	June 14, 2022
Multiple Cameras on the Same Scene OptiX	2	789	June 14, 2022
Take full advantage of CUDA core and RT core OptiX	1	2229	February 6, 2023
Access multiple BVH parallel OptiX	3	532	July 18, 2023
Multi-process access to a single Optix Context OptiX	5	766	June 14, 2022
Using Multiple Host Threads with Unique Contexts tied to Devices OptiX	4	1612	June 14, 2022
Optix 6.5 - interleaving CUDA kernels OptiX	2	739	October 12, 2021

Optix7.0: Could I use two streams for two optixLaunch operation in two threads for speed-optimize?

Related topics