Dynamic Parallelism in OptiX?

m_sch · January 8, 2014, 3:08pm

Hello,

are there any plans on supporting Dynamic Parallelism in OptiX?

It would be nice to launch multiple rays in parallel from e.g. a hit program.

HamzaC · January 14, 2014, 4:36pm

I would also appreciate such a feature !
I guess that would be possible only on Kepler architectures.

blloyd · January 15, 2014, 5:24pm

Do you have examples of use cases for using such a feature? What benefits do expect to get from it?

HamzaC · January 16, 2014, 9:26am

I’m working on ultrasonic wave propagation simulation. In the anisotropic case, each ray intersection with a surface generates up to 6 news rays toward six different directions.
Being able to throw these rays in parallel would prevent me from doing it sequentially.

Another case in which dynamic parallelism would really help is when I throw rays uniformly on a sphere to find out which directions reach a given surface. I need to be able to refine the process around these ray-tracing directions to get high precision in those areas and not waste computation time on the others.
Since the new kernels I need to launch depend on large amounts of data that are on GPU memory, I think it would be really helpful to launch these kernels directly from GPU without having to deal with data on CPU.

m_sch · January 16, 2014, 10:52am

I have similar use cases. One is the wave propagation of electromagnectic waves at diffraction edges. If an incoming ray intersects such an edge, this will generate e.g. 120 new rays, each of which has a different direction. Currently, I have to trace them sequentially.

blloyd · January 28, 2014, 4:48pm

So is there a performance concern with tracing rays sequentially? Or is there some other problem?

If it is better performance that you are looking for, keep in mind that dynamic parallelism has overhead. These use cases seem to have a relatively small number of rays. There probably isn’t enough work there to amortize the overhead.

Are the spawned rays coherent? For coherent rays there could be some potential benefit in having a construct in OptiX to spawn multiple rays. Such a construct would provide more information to the ray scheduler. This could be implemented efficiently without dynamic parallelism.

m_sch · January 29, 2014, 7:50am

Yes, it is a performance concern. What I “guess” (since the ray scheduling of OptiX is blackbox to me) happens is:

A packet/warp/… of coherent rays is traced.
Only one of these rays hits a “spawning object”, which triggers the creation of multiple rays.
These rays are created and traced in this single ray’s thread, stalling the rest of the rays
This might even get worse, if one of the newly created rays again hits another “spawning object” …

These newly created rays are coherent to some extent. So handing them over to OptiX alltogether could help a lot.

Topic		Replies	Views
Dynamic Parallelism in OptiX OptiX	1	571	April 20, 2023
Allowing multiple threads to process a single pixel. OptiX	5	1162	June 14, 2022
Concurrent 'launch' on same context possible!? OptiX	2	1178	June 14, 2022
Multiple Cameras on the Same Scene OptiX	2	803	June 14, 2022
Multi-process access to a single Optix Context OptiX	5	767	June 14, 2022
Optix 6.5 - interleaving CUDA kernels OptiX	2	753	October 12, 2021
Issues running OptiX concurrently with a CUDA kernel that uses shared memory OptiX	2	581	December 11, 2023
How many rays can be processed in parallel OptiX	1	607	August 14, 2023
Access multiple BVH parallel OptiX	3	552	July 18, 2023
Optix-low computational usage on GPU OptiX	12	942	June 22, 2022

Dynamic Parallelism in OptiX?

Related topics