Lack of support for threadfence in Optix IR

droettger · October 13, 2023, 2:37pm

Thanks, I updated the bug report with your system configuration information.

I really don’t know if what your doing is a viable solution. That’s why I wanted to know why you’re doing that to ask the OptiX core developers internally.

If you want to have all native CUDA programming methods available in your ray tracing application, another approach would be to implement a wavefront ray casting renderer where OptiX is only used for the ray-primitive intersection and the ray generation and all shading calculation would be done between optixLaunch calls with native CUDA kernels inside the same stream.
Since these kernel launches are asynchronous to the CPU, they would be processed in order on that CUDA stream as fast as possible.

The drawback of that approach is memory bandwidth for the ray input and hit/miss output results and the need to implement a nice processing pipeline where the work is chunked into GPU-saturating pieces.

There are multiple professional renderers which use this approach.

There is a very simple example of that approach inside the OptiX SDK optixRaycasting example, but that is lacking any iteration and chunking of work. It just shoots primary rays and does some shading with the normal vector on a model and saves that as image.

Related posts:
https://forums.developer.nvidia.com/t/branch-divergence/176258
https://forums.developer.nvidia.com/t/task-scheduling-in-optix-7/167050

Topic		Replies	Views
Is there any chance to implement barrier for Optix 7? OptiX	5	1086	October 12, 2021
Sharing code between OptiX 8 and the Application OptiX cuda , optix	7	59	October 11, 2024
Issues running OptiX concurrently with a CUDA kernel that uses shared memory OptiX	2	541	December 11, 2023
Use of warp-level primitives OptiX	2	527	December 29, 2022
Insufficient device memory. GPU does not support paging OptiX	15	5150	June 15, 2022
DXR - Inline Raytracing equavilent on OptiX OptiX	4	904	December 6, 2022
Using GL buffers from a second render thread OptiX	6	1217	June 14, 2022
OptiX 6.5 - Hanging Turing GPU unless enabling rtPrintf OptiX	5	1019	June 14, 2022
[resolved] rtTrace from bindless callable programs OptiX	6	1045	June 14, 2022
Optix-low computational usage on GPU OptiX	12	897	June 22, 2022

Lack of support for threadfence in Optix IR

Related topics