Can pgcc report the number of registers-per-thread when compiling OpenACC programs for NVIDA GPUs?
In some older forum posts, I saw that the output from PGI_ACC_NOTIFY included this info. For example:
CC 1.0 : 9 registers; 64 shared, 0 constant, 0 local memory bytes
CC 2.0 : 14 registers; 0 shared, 80 constant, 0 local memory bytes
but my version of pgcc (v14.9) doesn’t give this info.