about maximum amount of shared memory per multiprocessor

hello,
as far as i know, the maximum amount of shared memory per multiprocessor at the latest hardware is 48KB,
in my case, my cuda program is limited by the maximum amount of shared memory per multiprocessor,
because i calculate many small-scale fittings, that use iteration method, so i must use shared memory.
I want to know there is any plan to increase shared memory at next generation nvidia graphics cards?

thank you!