I’m looking at the changes in the 0.3.1 patch and wanted to confirm I see the important bits (I hope I didn’t miss a changelog in the docs somewhere.)
-
I’m sad to see sm86 is still not supported, is this correct?
-
It looks like most of the changes are in include/database, to definitions and kernel ptx and the addition of several look up tables.
- are these mainly performance related?
- how significant are they?
-
I see in the examples a call to set the MaxDynamicSharedMemorySize is added. (Glad to see this as i’ve been doing the same for quite a while, see below). Even though in your case the amount of memory is known at compile time (FFT::shared_memory_size) the call cudaFuncSetAttribute() is a runtime function right? Do you know if the compiler makes any different decisions?
Thanks!
Ben