I’m running a trial version of 12.4 on windows XP x86 and I’m trying the example contained on page 12 of this document http://www.pgroup.com/doc/openACC_gs.pdf.
I compile as per instructions on page 13 like this:
pgcc -V -acc test.c
but I get this:
pgcc-Error-Unknown switch: -acc
pgcc 12.4-0 32-bit target on x86 Windows -tp penryn
Copyright 1989-2000, The Portland Group, Inc. All Rights Reserved.
Copyright 2000-2012, STMicroelectronics, Inc. All Rights Reserved.
compiling with: pgcc -test.c works fine
If I try
pgcc -V -ta=nvidia:cc11 test.c
I get this:
pgcc 12.4-0 32-bit target on x86 Windows -tp penryn
Copyright 1989-2000, The Portland Group, Inc. All Rights Reserved.
Copyright 2000-2012, STMicroelectronics, Inc. All Rights Reserved.
NOTE: your trial license will expire in 13 days, 14.2 hours.
PGC/x86 Windows 12.4-0
Copyright 1989-2000, The Portland Group, Inc. All Rights Reserved.
Copyright 2000-2012, STMicroelectronics, Inc. All Rights Reserved.
PGC-S-0037-Syntax error: Recovery attempted by deleting keyword kernel (test.c:4)
PGC-S-0035-Syntax error: Recovery attempted by replacing identifier copyin by keyword cache (test.c: 4)
PGC-S-0035-Syntax error: Recovery attempted by replacing identifier copyout by keyword cache (test.c: 4)
PGC-S-0036-Syntax error: Recovery attempted by inserting before keyword for (test.c: 5)
PGC-S-0155-Clause not allowed in #pragma - cache (test.c: 5)
PGC/x86 Windows 12.4-0: compilation completed with severe errors
Any ideas?
TIA
output from pgaccelinfo in case it’s of any use:
CUDA Driver Version: 4020
Device Number: 0
Device Name: Quadro FX 770M
Device Revision Number: 1.1
Global Memory Size: 536543232
Number of Multiprocessors: 4
Number of Cores: 32
Concurrent Copy and Execution: Yes
Total Constant Memory: 65536
Total Shared Memory per Block: 16384
Registers per Block: 8192
Warp Size: 32
Maximum Threads per Block: 512
Maximum Block Dimensions: 512, 512, 64
Maximum Grid Dimensions: 65535 x 65535 x 1
Maximum Memory Pitch: 2147483647B
Texture Alignment: 256B
Clock Rate: 1250 MHz
Execution Timeout: Yes
Integrated Device: No
Can Map Host Memory: Yes
Compute Mode: default
Concurrent Kernels: No
ECC Enabled: No
Memory Clock Rate: 800 MHz
Memory Bus Width: 128 bits
Max Threads Per SMP: 768
Async Engines: 1
Unified Addressing: No
Current free memory: 472702976
Upload time (4MB): 1880 microseconds ( 0 ms pinned)
Download time: 3280 microseconds ( 0 ms pinned)
Upload bandwidth: 2231 MB/sec (2438 MB/sec pinned)
Download bandwidth: 1278 MB/sec (1792 MB/sec pinned)