I’m rewening the topic. I’ve tried to run the program using pgcollect. It generated few files, one of them is pgprof.out.
Then, I open pgprof and open new profiling session.
Profile: path to pgprof.out
Executable: path to executable program
Source: path to program sources
Then I get error info that the pgprof.out and/or the executable program are invalid. What can be wrong?
Additionaly, when I run the program through pgcollect it is considerably slower. (sic!)
But for me, it would be better to gain access to terminal and output the kernel timing info from -ta=nvidia,time option. In the code that I’m working with, there are a few variables for the logical i/o unit numbers. They are defined as parameters in a separate file (I have access). The variable for writing to terminal is called ITTY (equals 6). Using
write(itty,) ‘Something…’ or write(,*) ‘Something…’ is printing a text on the terminal, but the timing info does not appear anywhere. Where is the accelerator timing info printed out by default? (stdout? stderr?)