Generated PTX file twice bigger than Optix7 SDK sample

slazzo · September 4, 2019, 1:11pm

Building OptixPathTracer from the OptixSDK Samples generates 18Kb PTX file, the exe runs on GTX1060 with ~5fps
while building from scratch the same code generates a 39Kb PTX file and runs slower ~4fps.

Win10.0.17763.0, Cuda10.1, Optix7.0.0, VS Community 2019

Probably I’m missing something but can’t figure it out, here is the procedure taken:

Created a new CUDA Project in VS2019
Hand copied the project configuration from the sample on every category, only the include and lib paths differ but point to the same SDK
Created a new CUDA Source file, cloned the content from the OptixPathTracer.cu
Copy/Pasted the sample header and cpp files inside the new project
Changed the Project Configuration for CUDA C/C++ (-maxrregcount=0 --machine 64 -ptx -cudart static)
Superstition number, decided to have a coffee.
Project build successfully in Release mode, but the generated .ptx file has a lot more instructions compared with freshly build from the OptixSDK Samples.

After revisiting the project from the SDK I see that it doesn’t have CUDA as build dependency but instead it uses CMakeLists with what I’m not very familiar but obviosly is the only difference between both projects, also it’s my first experience with both CUDA and Optix and probably there is a better way to create a new project using those two that I’m not aware of but for now I’ll be happy if I could at least build the optimized .ptx file.

The Q is what nvcc flags I’m missing?

droettger · September 4, 2019, 2:01pm

If you diff-ed the *.ptx sources and found that the small code contained “approx” instructions for the trigonometric functions and square roots, and your code doesn’t, then you’re missing the --use_fast_math option.

Then you might not have used the same streaming multiprocessor target and some generate additional spurious runtime functions (which aren’t used).

Check your NVCC options for these things as well:
[url]Assertion failed: "acp->isUsedAsSingleSemanticType()" - OptiX - NVIDIA Developer Forums

slazzo · September 4, 2019, 2:25pm

Indeed this was the cause, in the property pages under CUDA C/C++ fast math option is available only for the host and was turned on, on device tab there is no such option, just added --use_fast_math in the Command Line additional options and now the .ptx is optimized, thanks!

Topic		Replies	Views
Unknown error :rtProgramCreateFromPTXFile caught exception: Assertion failed:[11403396] OptiX	16	3978	June 14, 2022
How to generate .ptx file using Visual Studio OptiX	2	2275	June 14, 2022
Compile optix without Cmake OptiX	13	1890	June 15, 2022
ERROR:: ptx file problem. generate incorrect ptx file which is the same with the obj file OptiX	2	983	June 14, 2022
Simple PTX shader - OptiX 7 OptiX	27	4548	October 12, 2021
Generating ptx code using OptiX 7.4 and CMake OptiX	7	2170	June 15, 2022
Problem generating .PTX files CUDA Programming and Performance	5	19472	October 31, 2014
Error Compiling .ptx files from within Visual Studio OptiX	9	2906	June 14, 2022
optix / windows OptiX	5	1115	June 14, 2022
How does optix code compilation work? OptiX	24	3786	July 7, 2022

Generated PTX file twice bigger than Optix7 SDK sample

Related topics