Shared memory is lifetime of block?

mimichw · May 14, 2007, 11:40am

main()
{
Part1<<< grid1, threads1 >>>( d_X , d_B , d_r , p_sum , d_module);
Part2<<< grid1, threads1 >>>( d_X , d_B , d_r , p_sum , d_module);
Part3<<< grid1, threads1 >>>( d_X , d_B , d_r , p_sum , d_module);
Part4<<< grid1, threads1 >>>( d_X , d_B , d_r , p_sum , d_module);
}

In .cu , within the main() , i configued kernel part1 , part2 , part3 and part4 to
execute on device . In part1, 2 ,3 and 4 , i have declared its shared memory
(ex : 4KB per block).

when i looking at .cubin

name = Part1
lmem = 0
smem = 1XXX0 <== example
reg = 8

name = Part2
lmem = 0
smem = 1XXX1 <== example
reg = 8

name = Part3
lmem = 0
smem = 1XXX2 <== example
reg = 8

…

Why shared memory are increasing when i configued more kernels.

prkipfer · May 14, 2007, 12:29pm

There is currently a known bug with the shared mem calculation if you put more than one kernel in the .cu file. The toolkit update to come shortly should fix this.

Peter

mimichw · May 15, 2007, 2:26am

there no method to solve this problem now?

If i am using only one kernel (ex:part1) , this can concurrently process more than two blocks in a multiprocessor . But , when i am using more kernels , shared memory are also increasing to more. How many kernel’s blocks can concurently process on multiprocessor are limited by shared memory.

prkipfer · May 15, 2007, 3:28pm

Put them into separate .cu files.

Peter

mimichw · May 16, 2007, 5:22am

I have tried to configue a kernel in another .cu, and using a function in

main .cu to call this kernel. But , i still must “include” this .cu into main .cu.

How to separate this problem?

prkipfer · May 16, 2007, 11:03am

You need a C (host) wrapper for each kernel invocation. Then you can simply call the kernels through the wrapper from a C function by including a forward declaration to the wrapper.

Peter

mimichw · May 17, 2007, 6:33am

it’s ok now. very thank you :)

Topic		Replies	Views
CUDA: Using shared memory between different kernels.. CUDA Programming and Performance	4	16298	July 21, 2017
Shared memory per block Related to shared memory of an MCPU CUDA Programming and Performance	3	3990	August 14, 2007
Shared Memory Is my program correct ? CUDA Programming and Performance	2	6827	March 23, 2009
Kernel Execution issues related to Shared Memory CUDA Programming and Performance	5	5164	November 9, 2009
Max shared memory CUDA Programming and Performance	2	1491	December 3, 2008
Shared memory issues Initialization of shared memory CUDA Programming and Performance	2	6725	August 23, 2007
shared memory and CUDA calculator CUDA Programming and Performance	6	4044	October 26, 2008
shared memory CUDA Programming and Performance	4	3268	April 24, 2007
Not enough shared mem CUDA Programming and Performance	5	5778	November 3, 2009
why does performance scale with allocated shared memory size? CUDA Programming and Performance	1	651	May 3, 2013

Shared memory is lifetime of block?

Related topics