I have an application with some CUDA kernels that runs without problems. However, I am not able to profile it with pgprof that returns the following message:
0: ALLOCATE: 0 bytes requested; status = 0(no error)
What could be the reason for this message? I checked all the variables that are allocated in the device, and they are all correctly allocated.