Why is TN format required for FP8 in cublasLtMatmul()?

From cublasLtMatmul():

To use FP8 kernels, the following set of requirements must be satisfied:
A must be transposed and B non-transposed (The “TN” format).

Why is this? Usually matmuls ask for “NT” format.

Thanks.