pgf77 performance issue 6.0 vs 5.2

akushner · December 9, 2005, 3:03pm

I have some legacy f77 code that I’ve been testing the 5.2 versus 6.0 compiler. The system I am running on is a Intel Pentium 4 running Suse 9.3, Linux 2.6.11. Doing a

pgf77 -V

I get 5.2-4 and 6.0-8 respectively. The CPU times I get for two different programs on 5.2 are;
238.951 and 221.501

For the 6.0 run for the same programs I get;
268.297 and 245.096

For both cases the compiler options concerning optimization are;
-fast -fastsse -Miniline

I have tested other programs and have gotten consistent results with 6.0 producing slower code than 5.2. This also appears to be true on our 64-bit Opteron systems as well. However, I would be very happy to move to 6.0 if I can resolve this problem. That level corrects execution-time problems in other programs that are compiled with 5.2 on our Linux 2.4 systems.

MatColgrove · December 9, 2005, 9:45pm

Hi akushner,

Can you post the 5.2 and 6.0 runtimes for the following flagsets:

-fastsse
-fastsse -Mipa=fast,inline

I want to see if the regression is caused by inlining or by some other optimization. Also, I want to see what happens if you use IPA inlining instead.

I suspect that a routine that was being inlined is not longer. To view what subroutines are being inlined add “-Minfo=inline” to the compilation line and compare the output between the 5.2 and 6.0.

Note that “-fast” is part of “-fastsse” so is not needed.

Mat

akushner · December 10, 2005, 12:49pm

Matt,

Thanks for the reply. I’ll run the tests and post the info when I get back to the office on Monday.

We have never used -Mipa because we get the message from the link phase (I can’t recall the exact message) that it was turned off because of not having a main or something. The entry to the programs is through a C front end, so I thought that caused it to be turned off (we have to use -Mnomain). If we could get Mipa to work that would be great.

Also, thanks for the note about -fast and -fastsse. I thought I saw there were some flags turned on by -fast that were not turned on -fastsse, but I may have misread the manual.

MatColgrove · December 11, 2005, 5:04am

IPA’s most likely complaining that it’s missing some IPA information. If you compile the C portion of the code with IPA as well, the message should go away. Also, you can try “-Mipa=fast,inline,safe”. “safe” tells pgipa that you think it’s safe to go ahead with the IPA recompilation even if your missing some information.

Mat

akushner · December 12, 2005, 4:53pm

Unfortunately we do not have PGI’s C compiler licensed. So, the pertinent output for both the 5.2 and 6.0 compiles with the Mipa and Minfo flags as you suggested looked like;

1, extracting subprogram for IPA, size 35
1, extracting subprogram for IPA, size 28
1, extracting subprogram for IPA, size 52
1, extracting subprogram for IPA, size 22
IPA inhibited: no main routine

So, I don’t think Mipa is a factor. The runtime table for the 4 runs is;
5.2 -fastsse -Mipa=fast,inline,safe; 239.535u 1.054s 4:12.18 95.4% 0+0k 0+0io 3pf+0w
5.2 -fastsse; 240.561u 1.128s 4:16.59 94.1% 0+0k 0+0io 3pf+0w
6.0 -fastsse -Mipa=fast,inline,safe; 270.837u 0.981s 4:45.92 95.0% 0+0k 0+0io 2pf+0w
6.0 -fastsse ; 270.509u 0.620s 4:46.15 94.7% 0+0k 0+0io 0pf+0w
While several other applications I’ve tested have shown that 5.2 object code is faster than the equivalent 6.0 code, I did test a different application this morning that has the 6.0 code being 10% faster than the 5.2 code.

Andy

MatColgrove · December 12, 2005, 6:12pm

Hi Andy,

It’s not inlining, so the next step is to start breaking out the individual components of “-fastsse” to determine which optimization is causing the slow-down. The most likely culprits are “-Mlre”, “-Msmart”, “-Mvect=sse”.

Try running with:

“-fastsse -Mnosmart”, “-fastsse -Mnolre”, “-fastsse -Mnovect”.

If those aren’t it, start at “-O2” and then progressively add in the following optimizations: “-O2 -Munroll=c:1 -Mnoframe -Mlre -Msmart -Mvect=sse -Mscalarsse -Mcache_align -Mflushz”.

Also, can you send us your code (to trs@pgroup.com) or is it available on the web? We should release 6.1 this week and I’d like to see if the regression still occurs. If it does, then I’ll file a technical problem report (TPR) to have the regression fixed.

Thanks,
Mat

Topic		Replies	Views
Flags for AMD64 Legacy PGI Compilers (archived)	3	7145	October 24, 2005
longer execution time in PGCC 6.0.5 than PGCC 5.2.4 Legacy PGI Compilers (archived)	3	6535	July 27, 2005
IPA not effective Legacy PGI Compilers (archived)	5	6504	January 5, 2007
Different answers with "-fast" and "-fastsse& Legacy PGI Compilers (archived)	1	12048	July 22, 2004
-Mipa Vs. -g Legacy PGI Compilers (archived)	3	7462	July 1, 2005
SEGV and -fast optimization (f90) Legacy PGI Compilers (archived)	2	3675	December 8, 2009
-Mipa and libraries Legacy PGI Compilers (archived)	4	4940	February 2, 2009
PGI 6.0 to 6.1: Mnoscalarsse and optimization Legacy PGI Compilers (archived)	1	4195	May 1, 2006
building 465.tonto from CPU2006 Legacy PGI Compilers (archived)	4	6352	April 6, 2009
adding -Mipa=fast produce pgf90-Fatal-/opt/pgi/linux86-64/9 Legacy PGI Compilers (archived)	3	3673	October 29, 2009

pgf77 performance issue 6.0 vs 5.2

Related topics