Understanding Compute Capability

fatecasino · November 16, 2024, 10:16pm

I am building a simple Optix based application and I came across the Compute Capability solution configuration. I am using Optix8 and cuda 12.6. My card is RTX 3050.
My Compute Capability is 8.6.

My question is how can I ensure that this application will run in other machines with higher or lower Compute Capabilities?
Should I set multiple Compute Capabilities at nvcc -gencode=arch=?
Thanks!

dhart · November 18, 2024, 6:17pm

Hi @fatecasino,

With OptiX, if you’re compiling to PTX or OptiX-IR, you can use the compute capability for whatever the minimum GPU version you need to support is, and newer GPUs will work. For example, use 50 if you need Maxwell support, or 60 for Pascal and beyond. This is detailed in the “Program Input” section of the “Pipeline” chapter in the OptiX Programming Guide: https://raytracing-docs.nvidia.com/optix8/guide/index.html#program_pipeline_creation#program-input

Specifically:

The streaming multiprocessor (SM) target of the input OptiX program must be less than or equal to the SM version of the GPU for which the module is compiled.
To generate code for the minimum supported GPU (Maxwell), use architecture targets for SM 5.0, for example, --gpu-architecture=compute_50. Because OptiX rewrites the code internally, those targets will work on any newer GPU as well.
CUDA Toolkits 10.2 and newer throw deprecation warnings for SM 5.0 targets. These can be suppressed with the compiler option -Wno-deprecated-gpu-targets.
If support for Maxwell GPUs is not required, you can use the next higher GPU architecture target SM 6.0 (Pascal) to suppress these warnings.
Define the output type with --optix-ir or --ptx. Do not compile to obj or cubin.

Here are some additional general CUDA resources about compute capability, in case you have further questions.

Application Compatability
PTX Compatability
Your GPU Compute Capability
List of compute capability features (tables 20 & 21)
CUDA toolkit minimum compute capability by Robert Crovella on Stack Overflow

–
David.

Topic		Replies	Views
Support multiple compute capabilities OptiX	9	1519	June 14, 2022
Determining correct compute capability for a loaded PTX file/kernel ? CUDA Programming and Performance	10	2578	February 11, 2015
Compute Capability for Geforce GTX 740m CUDA Programming and Performance	4	2654	October 9, 2014
[Solved]Can you run multiple instances of an application using Optix on a single GPU ? OptiX	4	1026	June 14, 2022
OptiX 8 Module Creation Version Mismatch? OptiX	3	698	November 15, 2023
How can I force my OptiX program to run on the GPU to improve performance? OptiX	6	815	June 14, 2022
nvcc - build with local card max compute capablity CUDA Programming and Performance	4	1787	October 7, 2015
Compute capability CUDA Setup and Installation	1	5278	June 19, 2018
From Kepler to Maxwell, do I need CUDA 6.5 ? CUDA Setup and Installation	8	3264	December 10, 2014
Build errors in Optix advanced samples OptiX	5	1570	June 14, 2022

Understanding Compute Capability

Related topics