Hi,
I would like to know if tensor core is utilized in cuDSS. Since cuBLAS uses tensor core for dense matrix operations and cuDSS is based on cuBLAS, I assume cuDSS also uses tensor core.
I tried to profile both float and double matrix solves. However, I did not see tensor core usage in sm80_xmma_gemm_f32/sm90_xmma_gemm_f64 functions. Does this mean cuDSS does not support tensor core at the current stage?
Thank you.
Di