TX2 compression/decompression

Tegra TX2 block digagram shows a compression/decompression unit inside the Pascal GPU block. Is it used for compressing data in deep learning inference ? Is there a CUDA API that triggers these units for a buffer/texture or is it transparent to the developer ?