I have a kernel that takes 10 registers, and if I compile it with --maxrregcount=16, it takes 9 registers.
Is it normal ? I’d rather keep it using 10 registers (in my mind, the maxrregcount optimizations often produce a little overhead (I’m not talking about register spilling))