I am trying to check the performance/contention of workloads with different numbers of SMs. Is it possible to disable some of the SMs in RTX 3090? I am running different applications (Neural networks, graphics) and would like to measure their performance with limited parallelism.

CUDA doesn’t provide any methods to do this. You can try some hacky methods such as launching a special kernel that keeps some SMs occupied, but this won’t work for graphics.