How do RT cores work?

thiltuiv · November 27, 2024, 2:51pm

Hello! From the NVIDIA TURING GPU ARCHITECTURE white paper, I learned that RT cores consist of two specialized units, where the first unit performs the bounding box test and the second unit performs the ray-triangle intersection test. They save the SM from spending the thousands of instruction slots per ray, which is a computationally intensive process making it impossible to do on GPUs in real-time without hardware-based ray tracing acceleration. I wonder why these two units are able to quickly complete operations that are time-consuming on SM.
Any suggestions is appreciated. Please let me know if there is any relevant documentation explaining how RT cores work. Thanks in advance.

Topic		Replies	Views
RTX arrangement CUDA Programming and Performance	11	1107	January 23, 2020
RT Core Accelerated ray marching GPU - Hardware	1	565	January 19, 2024
Profile RT cores Raytracing	0	85	May 9, 2025
How to utilize CUDA, Tensor, and RT cores in one program CUDA Programming and Performance	5	3257	September 17, 2024
Just out of curiosty, can RT cores be able to perform arithmetic operations? Raytracing	1	77	December 22, 2025
Does BVH construction use RT core? OptiX	1	250	July 12, 2024
Any lower access to RT core than OptiX? Raytracing	5	1921	July 21, 2024
Use RTX directly (without Optix, DirectX, Vulkan)? GPU-Accelerated Libraries	0	652	January 8, 2021
Maximizing GPU Utilization Raytracing	2	1479	July 17, 2023
Opt out of RT hardware OptiX	4	935	June 14, 2022

How do RT cores work?

Related topics