How does CuDNN-Backend support Batch Gemm Array?

If I would like to call BatchGemm(like semantics in cublasGemmBatchedEx) fusion kernels, how could I achieve using Backend API?

I cannot find examples on batch gemm array in latest cudnn-frontend, and no description in CuDNN document, could you shed some light on this usage?

Thanks
Gino

Hi,
The Batch Gemm operation is described in our API reference doc here,

A sample matmul example is here at

One can modify the batch size by changing the example in accordance to the rules specified in the API.

Thank you.

Hello, I noticed the example already have ability to achieve the same semantics as bias API “cublasGemmStridedBatchedEx”, but my point is how could Backend API support same semantics of bias API “cublasGemmBatchedEx”. The main issue is how could user configure the pointer array for A & B?

Thanks
Gino