Great job with the new nvhpc images; they clean up a ton of things I’ve had to by hand before in images with cuda-aware openmpi/etc.
The runtime images have a huge number of layers (~100) caused by a ton of discrete copies (docker history [nvcr.io/nvidia/nvhpc:20.9-runtime-cuda11.0-ubuntu20.04
](http://nvcr.io/nvidia/nvhpc:20.9-runtime-cuda11.0-ubuntu20.04) has 100 layers). This makes it difficult to build on top of, since the max number of layers in docker with overlay2 is ~125.
It would be helpful if the runtime images were squashed to a single image. I can squash before putting things on top, but this feels unnecessary.