Hello, mat, I am now pgprof my c++ program, at the end, I encountered such a prompt,
The start and end timestamps on 310 kernels,memecpys and other collected profile data are invalid .Those profiling records have been dropped and will not be displayed inthe timeline.
And when I use the kernel memory function, I will get, insufficient kernel memory data:
the data needed to perform memory bandwidth analysis for the kernel could not be collected.
The same is true when using kernel performance. Nothing can be gained.
Pgc++ 18.10-0 64-bit target on x86-64 Linux -tp haswell
PGI Compilers and Tools
Copyright © 2018, NVIDIA CORPORATION. All rights reserved
| NVIDIA-SMI 410.48 Driver Version: 410.48
Cuda 10.0 x86 linux
There is also a small problem, how to use shared memory in openacc? Is there a link or example for reference? For example, I have several data that are often used repeatedly. I want to improve the efficiency of the program by storing it in shared memory.