Is there any API support these functions
- whether can I explicitly bind a certain thread block to a certain SM?
2)if 1 can, whether can I keep the result of block(kernel) on shared memory and let next block(kernel) access it?
my GPU is G295
Thanks,