3D Launch using Opitx to obtain 3D-complex data

droettger · March 11, 2022, 8:31am

Ok, you’re saying you shoot 512 x 512 x 512 x 64 x 64 = 2^39 = 549,755,813,888 rays per launch.

That’s about 550 GRays rays per launch, so even when assuming you have the highest-end RTX board, and let’s say that handles around 10 GRays/second in your case (which it won’t, even if the limit is actually higher), that would still take 55 seconds for one launch without doing anything else.

Now, what is your actual system configuration and how long does it really take?

Please always provide the following system configuration information when asking about OptiX issues:
OS version, installed GPU(s), VRAM amount, display driver version, OptiX (major.minor.micro) version, CUDA toolkit version (major.minor) used to generate the input PTX, host compiler version.

From the cropped code screenhots (note that the forum supports code blocks) it’s not apparent how your index_real and index_imag advance or if they are the same.

If the index_real and index_imag are constant inside the loop, it would be faster if you would not accumulate the result into the two output buffers inside that loop but accumulate the results into a local variable to keep them in registers and only write it once at the end.

Also note that it’s less efficient to write two individual floats to separate buffers because these would lie in different memory cache lines. The GPU microcode supports vectorized load and store instructions for 2- and 4-component data types, not 3-component which are handled as three individual scalars. Means it would be faster if you handled your complex numbers as float2 vectors and store them into one output buffer if possible.

The 3D launch will be scheduled as 2D slices.
See this discussion about the order and potential access hazards: https://forums.developer.nvidia.com/t/optixlaunch-configuration-revisited/198275

Topic		Replies	Views
rtContextLaunch3D OptiX	2	968	June 14, 2022
Launch dimensions in LaunchContextnD and optixLaunch OptiX	5	1686	October 12, 2021
3D OptixLaunch to accommodate multiple viewpoints OptiX	4	1152	October 12, 2021
rtContextLaunch1D with multiple GPUs OptiX	3	784	June 14, 2022
Launch_index and Launch_dim OptiX	5	2682	June 14, 2022
Launch size for best performances OptiX	11	1050	June 14, 2022
Issue with large 3D program launch size OptiX	3	442	June 14, 2022
Optix 6, moved for loop in ray gen to launch index. Getting launch error 9 OptiX	1	625	June 14, 2022
OptiX crashing when launching pipeline with big data OptiX	5	1018	June 14, 2022
launch time out with Optix Prime 3.7 beta 3 OptiX	5	1150	June 14, 2022

3D Launch using Opitx to obtain 3D-complex data

Related topics