Does cublaslt batch mode for Pointer Arrays apply for scaling factors as well?

Anoop_Prabha · January 2, 2026, 6:40am

Pointer Array batch mode unlike the traditionally supported strided batch mode, allows individual matrices in non-contiguous memory locations to be part of batch by passing an array of device pointers. But what happens for the scale A pointer and other scaling factor pointers for other matrices. Does each matrix in the batch have a unique scaling factor and hence even this becomes a pointer array or does a single scaling factor is shared across the batch. Please advise.

Topic		Replies	Views
New cuBlas function "getrfBatched" Legacy PGI Compilers	5	8664	January 21, 2014
CUDA Pro Tip: How to Call Batched cuBLAS routines from CUDA Fortran Technical Blog	4	443	April 27, 2015
How can I call cublasSgemmBatched on pointer device arrays without allocating them twice? GPU-Accelerated Libraries	3	1618	August 2, 2018
Way to covert pointer (d_A) to array (d_Array []) GPU-Accelerated Libraries	4	659	April 3, 2019
Confusion with cublas pointer mode CUDA Programming and Performance	0	914	May 15, 2015
Try calling cublasgemmBatch in OpenACC Legacy PGI Compilers	2	711	June 26, 2023
CUBLAS Batched Matrix Multiplies? CUDA Programming and Performance	4	6826	March 12, 2012
Solve one dense linear system in one thread block CUDA Programming and Performance	4	771	January 20, 2017
CUBLAS operating on different parts of an array CUBLAS based code development CUDA Programming and Performance	2	8132	August 8, 2010
Batch Matrix Multiplication using CuBLAS GPU-Accelerated Libraries tensorrt , cuda , kernel , c-plus-plus	17	4041	March 2, 2021

Does cublaslt batch mode for Pointer Arrays apply for scaling factors as well?

Related topics