CUDA/Optix GPU Utilisation

abcthomas · February 28, 2018, 10:55am

Ok we’re going to do some testing on our Amazon P2 instance later and try to get some information with one of the gpu’s disabled if possible.

Are there any examples of RT_BUFFER_GPU_LOCAL usage ? And yeah we don’t use any float3 buffers.

The display of results isn’t factored in, we just perform tracing and encode the resulting buffer to a file after all samples have been taken.

Is there a way to explicitly set the stack size ?

We’re going to do some testing on an Amazon P3 instance which boasts 4x Tesla V100 GPU’s. Unfortunately I think that’s the only other GPU generation option available through Amazon EC2.

So thanks for all that info and we do have plans to switch to iterative tracing at some point, but what about the fact that we can only execute two blocks in parallel due to thread register usage ? It seems that even if we make those optimisation we’ll still be missing out on a lot of parallelism if I understand right (which I might not). Or will the stack size directly affect that ?

Topic		Replies	Views
GPU program optimization questions OptiX	4	1075	December 2, 2021
Optix-low computational usage on GPU OptiX	12	923	June 22, 2022
Multi GPU OptiX	7	3126	June 14, 2022
Optix 6.5 Demo Performance Concern OptiX hw , cuda	6	1538	October 12, 2021
Multi-GPU with OptiX OptiX	10	5445	June 14, 2022
OptiX Time for Launch OptiX	9	1327	June 14, 2022
How to understand and set the stack size ? OptiX	5	2973	June 14, 2022
Progressive photon mapping sample with multiple GPUs OptiX	7	1904	June 14, 2022
[Resolved] What is in the stack? OptiX	9	2115	June 14, 2022
Question about handling buffers when using multiple GPUs? OptiX	14	3855	June 15, 2022

CUDA/Optix GPU Utilisation

Related topics