I installed the HPC SDK Version 23.11 today and also noticing slow compilation and standard output compared to g++ and clang++.
g++
$ time g++ -o llil4hmap -std=c++20 -fopenmp -Wall -O3 llil4hmap.cc -I./parallel-hashmap
real 0m3.574s
user 0m3.441s
sys 0m0.124s
$ ./llil4hmap /data1/input/big* | cksum
llil4hmap (fixed string length=12) start
use OpenMP
use boost sort
get properties 3.759 secs
hmap to vector 1.043 secs
vector stable sort 1.715 secs
write stdout 0.868 secs
total time 7.386 secs
count lines 323398400
count unique 200483043
701308064 1804347429
clang++
$ time clang++ -o llil4hmap -std=c++20 -fopenmp -Wall -O3 llil4hmap.cc -I./parallel-hashmap
real 0m2.963s
user 0m2.887s
sys 0m0.068s
$ ./llil4hmap /data1/input/big* | cksum
llil4hmap (fixed string length=12) start
use OpenMP
use boost sort
get properties 3.759 secs
hmap to vector 0.710 secs
vector stable sort 1.125 secs
write stdout 0.702 secs
total time 6.298 secs
count lines 323398400
count unique 200483043
701308064 1804347429
nvc++
$ time nvc++ -o llil4hmap -std=c++20 -fopenmp -Wall -O3 llil4hmap.cc -I./parallel-hashmap
real 0m21.274s
user 0m20.828s
sys 0m0.413s
$ ./llil4hmap /data1/input/big* | cksum
llil4hmap (fixed string length=12) start
use OpenMP
use boost sort
get properties 3.804 secs
hmap to vector 0.697 secs
vector stable sort 1.104 secs
write stdout 5.071 secs
total time 10.678 secs
count lines 323398400
count unique 200483043
701308064 1804347429