OptiX 5 setStackSize appears to multiply by 5

bdr · March 8, 2018, 11:03pm

I’m using OptiX 5 with CUDA 9.1 in Visual Studio 2015, running Windows 10 with a Quadro K1100M and driver 391.03.
If I call setStackSize, then getStackSize, the returned stack size is 5 times the argument to setStackSize (or sometimes a bit more, presumably due to alignment rules). This did not occur with my previous configuration (OptiX 3.9.1, CUDA 7.5, VS2013). Is this a bug, or am I misunderstanding the new usage somehow?
Thanks

droettger · March 12, 2018, 8:56am

Yes, that’s an unfortunate change from OptiX 3 to 4.
See explanations here: [url]Inconsistency when using getStackSize / setStackSize functions - OptiX - NVIDIA Developer Forums

bdr · March 15, 2018, 8:15pm

Thanks for the information. Is the 5x multiplier consistent for OptiX 5.0 executables across supported GPU architectures?
While troubleshooting intermittent out-of-memory issues, I’ve been logging getAvailableDeviceMemory() results at various points in my code, and I’m seeing some odd results. If nvidia-smi reports ~1GB of free device memory, getAvailableDeviceMemory after context creation typically reports ~1.7GB free device memory. A particular test case sometimes runs with ~900MB free, and sometimes fails with an out-of-memory error. Sometimes it reports ~100MB free after the first OptiX launch, then ~900MB after subsequent launches. I’ve even seen some runs where OptiX reports 0 bytes free device memory, then my application allocates a few more buffers and runs without issues. I know that other processes impact the device memory available, but during these tests I’m not doing anything that should generate 900MB of variability.
How is getAvailableDeviceMemory different than nvidia-smi’s device memory usage? Is it trying to account for device memory which could become available by swapping device-resident buffers to system memory? Any ideas why I’m seeing such large variation in memory availability, and/or how I could better manage it (besides using less memory in general or dropping support for smaller GPUs)?
Thanks

droettger · March 16, 2018, 9:00am

“Is the 5x multiplier consistent for OptiX 5.0 executables across supported GPU architectures?”

Yes, which makes it even more important to figure out the minimum stack size at this time, because GPUs with more cores will need more memory.

I cannot tell what would result in the differences in available memory in your case. That is a state which is constantly in flux, also due to the underlying OS. For OptiX esp. the acceleration structure building process itself will take quite some amount of memory temporarily during build and depending on when you’re reading the free memory amount it can vary drastically.

I would expect that nvidia-smi and the rtDeviceGetAttribute() function in OptiX use the same underlying CUDA interface to read the device information.

I’m sometimes using a small nvidia-smi command to dump device information in a command prompt while running applications. Looks like this and prints the information per installed device every 500 ms.

"C:\Program Files\NVIDIA Corporation\NVSMI\nvidia-smi.exe" --format=csv,noheader --query-gpu=timestamp,name,pstate,memory.total,memory.used,utilization.memory,utilization.gpu --loop-ms=500

You could also add the rtContextSetUsageReportCallback() function added in OptiX 5.0.0 to dump some human readable information about what OptiX did internally.

Topic		Replies	Views
Inconsistency when using getStackSize / setStackSize functions OptiX	4	1360	June 14, 2022
How to understand and set the stack size ? OptiX	5	2994	June 14, 2022
Unknow error rtcore error m_api:pipelineSetStackSize: returned[6] invalid stack size OptiX	4	937	June 14, 2022
CUDA/Optix GPU Utilisation OptiX	5	2691	June 14, 2022
[Solved]Can you run multiple instances of an application using Optix on a single GPU ? OptiX	4	1070	June 14, 2022
OptiX 6.0.0 performance loss? OptiX	13	1414	June 14, 2022
Grid/block size and performance changes between OptiX 3.9.1 and 4.1.0 OptiX	9	1695	June 14, 2022
OptiX launch dimension inquiry OptiX	4	547	December 18, 2023
Launch size for best performances OptiX	11	958	June 14, 2022
Code work well in Optix 3.9.1 , but fail in Optix 4.0.2 OptiX	3	2227	June 14, 2022

OptiX 5 setStackSize appears to multiply by 5

Related topics