Meaning of target compiler option cuda2.3/ cuda3.0?

Hello,
what do the target compiler options cuda2.3 and cuda3.0 do?
I tried to find it out but couldn’t find anything except: “Use CUDA 2.3/3.0 Toolkit compatibility”…

I changed my Fortran PGI Accelerator compilation from
pgf90 -g -ta=nvidia,cc11 -Minfo -fastsse -c ./jacobi.F90 -o jacobi.o
to
pgf90 -g -ta=nvidia,cc11,3.0 -Minfo -fastsse -c ./jacobi.F90 -o jacobi.o
and suddenly my performance increased from 3300 MFlops to 4600 MFlops!

So, is cuda2.3 the default option? And what happens internally when switching to 3.0?

Bye, Sandra

Hi Sandra,

So, is cuda2.3 the default option?

Yes. The default is to use the CUDA 2.3 toolkit.

And what happens internally when switching to 3.0?

The PGI compiler will use the NVIDIA CUDA 3.0 toolkit. CUDA 3.0 was just released and is new in PGI 10.4. For Fermi, you’ll need to use CUDA 3.0.

Note that you will need to have a NVIDIA driver that supports CUDA 3.0. If you are unsure which version you have, the PGI utility “pgaccelinfo” will show your version number.

Hope this helps,
Mat

% pgaccelinfo
CUDA Driver Version            3000   

Device Number:                 0
Device Name:                   Tesla T10 Processor
Device Revision Number:        1.3
Global Memory Size:            4294770688
Number of Multiprocessors:     30
Number of Cores:               240
Concurrent Copy and Execution: Yes
Total Constant Memory:         65536
Total Shared Memory per Block: 16384
Registers per Block:           16384
Warp Size:                     32
Maximum Threads per Block:     512
Maximum Block Dimensions:      512, 512, 64
Maximum Grid Dimensions:       65535 x 65535 x 1
Maximum Memory Pitch:          2147483647B
Texture Alignment              256B
Clock Rate:                    1296 MHz
Initialization time:           4424750 microseconds
Current free memory            4256304896
Upload time (4MB)               998 microseconds ( 796 ms pinned)
Download time                  1194 microseconds ( 769 ms pinned)
Upload bandwidth               4202 MB/sec (5269 MB/sec pinned)
Download bandwidth             3512 MB/sec (5454 MB/sec pinned)

Thanks for your reply!