I am trying to run a program on a Fermi card, configured at runtime with 48 K of shared memory and 16 KB of L1 (I plan to use the cuda runtime function to modify the shared memory/L1 cache size parameters.)
Now, I wrote a program anticipating 48 KB of shared memory (the shared memory footprint was more than 16 KB). Now nvcc complains that I am using excessive memory. How do I inform the compiler that I plan to use 48 KB instead of 16 KB of shared memory?