NVIDIA Shader Compiler Consumes too much memory

DrChat · May 26, 2016, 10:54pm

Hey, so I’m working on a shader compiler that takes shader microcode from another language and converts it to SPIR-V.

I’ve noticed that when dealing with code that does a lot of branching (jump statements and such), NVIDIA’s shader compiler will eat up a lot of memory.
I’ve even had plenty of cases where it would consume so much that it would deadlock my entire computer, prompting a restart.

From the compiled shader code, I could see that NVIDIA’s compiler was taking the dumb route and duplicating a lot of code when taking branches. This can be fixed by including OpSelectionMerges, but unfortunately, there’s no metadata in the source shader code that describes high-level control flow constructs.
We could add a compiler that analyzes the code and adds in OpSelectionMerges, but my question is: Is NVIDIA going to address this behavior?

Mathias_Schott · May 27, 2016, 2:20pm

Could you provide the SPIR-V shader in question so we can take a look?

Regards,

Mathias Schott

DrChat · May 27, 2016, 5:49pm

Disasm: // /* 0.0 */ exec // /* 22 */ add r6.x__w, r5.xyyy, c242 - Pastebin.com

At the top is the source microcode, and then the SPIR-V disassembly follows.
For the jmp instructions, we generate an OpBranchConditional without any OpSelectionMerges, which seems to be problematic for the compiler.

Binary: https://dl.dropboxusercontent.com/u/5619434/Development/Xenia/shader_vk_4ED842B1EDA370D7.bin.frag

Mathias_Schott · May 27, 2016, 6:19pm

Thanks. We’ll take a look.

Mathias

Mathias_Schott · May 30, 2016, 2:26pm

Some intermediate observations from our shader compiler team:

Our SPIR-V consumer requires "structured control flow" which was intended for all code originating from GLSL. We could probably do a work-around of sorts but ideally you'd extend your code generator since that would also reduce the risk of running into similar issues on other SPIR-V consumers. Speaking of which, have you tried running this on other Vulkan implementations?
spirv-disam (SPIRV-Tools v2016.0-dev spirv-1.1-rev1-26-g0d512bb)tool fails with
```
%2573 = OpTypeVector %512 1024
   error: 319: Invalid opcode: 11008
```
Minor: .spv seems to be the canonical file extension for files containing raw SPIR-V content
Minor: The generator number is set to -1. Getting a “generator number” from the SPIR-V registry might be interesting at some point

Mathias

DrChat · May 30, 2016, 4:20pm

Yeah - I figured :P. Like I said though - this code doesn’t originate from GLSL but instead originates from microcode intended to run on another graphics chip entirely. And no - I haven’t been able to try this on other implementations due to the lack of alternate hardware.

It’s a bit odd though that the compiler is consuming enough memory to hang my computer. I’ve got 8GB of ram and I am currently idling at 5GB used. Xenia uses around 1GB during emulation putting me at around 6-7GB used - so if the compiler gets a wonky shader it’ll use up the rest of my free ram and hang my entire computer.

Hmm - our copy of spirv-disasm appears to be working OK (source of the pastebin disasm).

Thanks Mathias and the compiler team

Mathias_Schott · May 31, 2016, 10:01am

You call it odd, the spec calls it undefined behavior, so all bets are off ;)

https://www.khronos.org/registry/spir-v/specs/1.1/SPIRV.html#_validation_rules_for_shader_a_href_capability_capabilities_a

CFG:
Loops must be structured, having an OpLoopMerge instruction in their header.
Selections must be structured, having an OpSelectionMerge instruction in their header.

On a more serious note, a SPIR-V for Vulkan validation layer should catch this.

SPIR-V does “open the door” to non-structured control flow. Our compiler doesn’t allow anything but structured flow. So if this is really shader code it is required to be in CFG form.

Btw GitHub - KhronosGroup/SPIRV-Cross: SPIRV-Cross is a practical tool and library for performing reflection on SPIR-V and disassembling SPIR-V back to high level languages. also uses 10GB+ of memory when working on this .spv file…

Topic		Replies	Views
Stack Overflow in SPIR-V Compiler Vulkan	0	1030	August 12, 2016
NVCC Compling question, where is the lmem? CUDA Programming and Performance	5	1481	March 4, 2011
NVRM: Xid (0084:00) kernel does not terminate CUDA Programming and Performance	16	8754	April 10, 2008
NVCC Compling question, where is the lmem? CUDA Programming and Performance	2	608	March 2, 2011
Weird use of registers Too many registers are wasted CUDA Programming and Performance	8	5490	July 4, 2007
BUG: Broken register allocation, toolkit 2.3 CUDA Programming and Performance	15	6918	May 10, 2010
Possible nvcc compiler error CUDA Programming and Performance	0	1367	September 8, 2009
shared memory usage by nvcc CUDA Programming and Performance	0	2503	September 14, 2008
Why does vkCreateComputePipelines take so long for a recursive-like computer shader? Vulkan	5	1894	January 4, 2017
How to force variables to be on a register, local memory or shared memory? CUDA Programming and Performance	6	7282	March 21, 2008

NVIDIA Shader Compiler Consumes too much memory

Related topics