Example code of Outer Vector Scaling for FP8 data types

I would like to experiment with Outer Vector Scaling for FP8 data types using CUBLASLT_MATMUL_MATRIX_SCALE_OUTER_VEC_32F.

However, I haven’t been able to find any example code for this. ( GitHub - NVIDIA/CUDALibrarySamples: CUDA Library Samples )

Could someone kindly share an example of how to use Outer Vector Scaling with FP8?