Using the value attribute in a module and automatic arrays

crip_crop1 · January 10, 2011, 3:59pm

Hi there,

I’m trying to port a rather large code to GPU but I’m running in to some difficulties.

(1) Is it allowed to use the [/b]value attribute in module scope. By doing this does the compiler automatically know that the variable is a device variable, as the compiler throws up errors if I put both the device and the value**?

(2) 5 of the arrays I’m trying to copy over to the device are automatic arrays (their size is determined by an adjustable variable, dependent on the data inputted in other parts of the program), which is not allowed.

Would it be possible to get around this by allocating the array size, or is this still classed as being automatic? As I’m trying to save memory usage on the device I’d rather use a more efficient method than just setting a huge arrays.

Any suggestions or comments would be much appreciated.

Cheers,
Crip_crop**

MatColgrove · January 10, 2011, 11:32pm

Hi Crip_crop,

(1) Is it allowed to use the [/b]value attribute in module scope. By doing this does the compiler automatically know that the variable is a device variable, as the compiler throws up errors if I put both the device and the value> ?

I"value" indicates that a scalar argument is to be passed by value into to subroutine and used to initialize a local device scalar. “device” when applied to an argument means that you’re passing in by reference a pointer to a scalar in global device memory that is shared by all threads. Hence the two are not compatable.

Would it be possible to get around this by allocating the array size, or is this still classed as being automatic? >

No. Currently NVIDIA devices do not allow memory allocation from device kernels. You must use fixed sized arrays in your kernels or allocate device data from the host.

Maybe you can rework your algorithms so that your threads can share the arrays and hence make them allocatable device arrays allocated from the host before you launch your kernel.

Alternatively, you may be able to use assumed-size shared arrays where the size is the third argument of your kernel configuration. Though, you are limited by the size of the shared memory.

Hope this helps,
Mat

Hope this helps,
Mat

crip_crop1 · January 11, 2011, 11:24am

That’s really helpful, thanks.

Are their any performance implications from using the value attribute? Or is it in fact quicker because the scalar variable is stored in local memory. Also, is there a limit to the number of scalars passed by value?

Cheers,
Crip_crop

crip_crop1 · January 11, 2011, 12:44pm

Another issue with “value”…

I seem to be having a problem with giving variables the “value” attribute in module scope. Is this allowed?

Cheers,
Crip_crop

MatColgrove · January 11, 2011, 6:25pm

Hi Crip Crop,

I seem to be having a problem with giving variables the “value” attribute in module scope. Is this allowed?

The “value” attribute is only allowed on scalar dummy arguments.

Are their any performance implications from using the value attribute?

By default, Fortran passes arguments by reference (i.e. an address in memory). The Fortran 2003 value attribute can be used to over ride this default by passing the argument’s value and initializing a local variable to this value.

In CUDA Fortran, the ‘value’ attribute allows you to use host scalar variables as arguments. Without ‘value’, you would be passing in an address in host memory or need to create a variable in device memory to store the value before passing it in.

As for performance, in general it’s better to use local kernel variables rather than global variables since they will be more likely stored in a register. Though, you are limited in the number of registers available so it’s best to not use too many local variables else they’ll ‘spill’ to global memory.

Also, is there a limit to the number of scalars passed by value?

In CUDA Fortran, there is not a limit on the number of variables that can be passed to a kernel.

Hope this helps,
Mat

Topic		Replies	Views
declaring argument's variables as the size of local arrays Legacy PGI Compilers	1	1753	March 29, 2011
Variable Attributes Legacy PGI Compilers	3	3025	October 10, 2012
device arrays may not be automatic Legacy PGI Compilers	1	4727	October 2, 2014
CUDA Fortran:How to define variables in a kernel subroutine. Legacy PGI Compilers	1	2489	December 5, 2012
Define variable as on device within device code? Legacy PGI Compilers	11	720	October 12, 2021
CUDA fortran device array parameter Legacy PGI Compilers	5	7498	December 24, 2010
internal compiler error with CUDA Fortran Legacy PGI Compilers	7	15433	February 12, 2010
Device variables allocated in subroutines behave unexpectedly nvc, nvc++ and nvfortran	2	281	January 30, 2024
passing device allocatable array to kernel subprogram Legacy PGI Compilers	1	6208	February 19, 2010
Device arrays may not be automatic Legacy PGI Compilers	3	2867	March 18, 2014

Using the value attribute in a module and automatic arrays

Related topics