Maximum number of emitted rays per second

fayu-S · January 29, 2024, 6:52am

I am using optix and other APIs(like vulkan etc) as a basic rasterizer.
I want to know where to find the theoretical maximum number of rays emitted per second for different graphics cards，is it affected by the number of cuda cores or other hardware parameters？

Any help could be appreciated.

fayu-S · January 29, 2024, 6:53am

Here is another topic about this.
How to implement a rasterizer with optix - Visualization / OptiX - NVIDIA Developer Forums

droettger · January 29, 2024, 10:51am

I don’t think the marketing material contains that maximum rays/second number.
It mainly depends on the number and generation of RT cores, cache sizes, and the memory bandwidth.
Overall ray tracing performance inside applications then also depends on the generation, number and speed of the streaming multiprocessors (CUDA cores) and memory accesses which is affected by the cache sizes and memory bandwidth again.

The specification or datasheets of the individual GPUs or the architecture documents of GPU generations contain TFLOPS/second numbers for the different core types which can be compared.

Please have a look at Compare NVIDIA RTX Graphics Solutions section here for current Ada, Ampere and Turing workstation boards for example:
https://www.nvidia.com/en-us/design-visualization/desktop-graphics/#nv-accordion-bd50fbb79d-item-dbeafd9890

In general, the newer and more RT cores, the higher the memory bandwidth (the wider the memory bus), the bigger the caches, the more streaming multiprocessors, the higher the clocks, the more VRAM, the better is the board for GPU raytracing.

Here are links to tables with most of these numbers for the different boards: https://en.wikipedia.org/wiki/Nvidia_RTX

A very coarse rule of thumb is that each RTX GPU generation doubled the ray tracing performance.
That is not always true for all ray tracing features, for example, motion blur on triangle data improved 5-fold in Ampere over Turing in actual applications (and even more in dedicated tests) and curve primitive intersection improved similarly.

So if you’re looking for the highest ray tracing performance, pick the Ada GPU based board with the biggest product number you can afford. Today the RTX 6000 Ada Generation board is the highest spec. RTX 4090 when you can live with consumer grade products.

fayu-S · January 30, 2024, 1:37am

Thank you for your answers to these two posts.
I will try the latest RTX40 series boards. I hope this can solve our current bottleneck.

system · February 13, 2024, 1:37am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Shadow casting efficiency OptiX	10	957	April 29, 2022
Need lots of rays determining what they would hit in a scene. Can cuda help? CUDA Programming and Performance	9	568	April 10, 2019
CUDA/RTX CUDA Programming and Performance	4	64	September 8, 2024
How many rays can be processed in parallel OptiX	1	602	August 14, 2023
Compute rays/sec for Optix Program OptiX	2	45	November 26, 2024
Performance dependence on kernel resolution OptiX	3	1297	June 14, 2022
How to implement a rasterizer with optix OptiX cuda , gpu , optix	7	767	February 16, 2024
RTX triangles performance, any tips? OptiX	9	1907	June 14, 2022
Server hardware choice for optix raytracing OptiX	3	1221	March 21, 2022
Multiple ray generation points (camera) OptiX	4	746	June 14, 2022

Maximum number of emitted rays per second

Related topics