You’ll need to perform an analysis to determine the cause. The PGI profiler, PGPROF, is MPI aware, so I would suggest to start by profiling your code to see where the differences are. See: http://www.pgroup.com/doc/pgprofug.pdf
Do you have two i7-6700K chips in your system? If not, this means that you are oversubscribing your cores by using hyper-threads and this can lead to slowdowns. Your chip only has 4 physical cores with two hyperthreads per core. Hyperthreads are good for throughput but not for speed as the two threads will be competing for the same resources on the core.