I install pgi cd under /hptc_cluster/pgi, so the mpich (mpich1) is under /hptc_cluster/pgi/linux86-64/7.2/mpi/mpich/
but when I tried to debug a mpi application with :
mpirun -machinefile machinefile -np 4 -dbg=pgdbg ./turb
it allways stoped at mpi_init, and the command pannel said:
pgdbg>
Reloading:
/skovira/home/ccsad_1/zxiao_jhu_s_pgi_mpich1_debug/turb -p4pg /skovira/home/ccsad_1/zxiao_jhu_s_pgi_mpich1_debug/PI13701 -p4wd /skovira/home/ccsad_1/zxiao_jhu_s_pgi_mpich1_debug -mpichtv
ERROR: cannot read value at address 0xF2E0F018.
ERROR: cannot read value at address 0xF2E0F000.
Signalled SIGSEGV at 0x407364, function main
0x407364: 48 89 5C 24 8 movq %rbx,8(%rsp)
ERROR: cannot read value at address 0xF2E0F018.
ERROR: cannot read value at address 0xF2E0F000.
I tried to see the message queue window, it said:
ERROR: unable to load /hptc_cluster/pgi/linux86-64/7.2/mpi/mpich/lib/libtvmpich.so: libpgc.so: cannot open shared object file: No such file or directory
Aborting message queue dump.
\
In fact, this application is able to run with 4 processes:
/hptc_cluster/pgi/linux86-64/7.2/mpi/mpich/bin/mpirun -machinefile machinefile -np 4 ./turb
Where is the question? Thanks very much.
-sad