Is it possible to limit the number of SMs that a program is using? Assume the device has 15 SMs an I want to use only 8 of them? Is there any option in the nvcc or nvidia commands?
there are no options in any NVIDIA tools, nor anything provided by the CUDA runtime to easily enable this.