Computing the log of a matmul from the log of two matrices

enzo.marsot · September 13, 2025, 1:59pm

Hello everyone,

I am looking for a way to perform the elementwise log of a matrix multiplication, from the elementwise log of both matrices, so I want $\log(AB)$ from $\log(A)$ and $\log(B)$.

My goal initially is to implement this in Triton, but I’m open to any solution in CUDA or even pseudocode. Do you have any suggestions how I could modify the code in the Triton tutorial to avoid losing too much efficiency?

https://triton-lang.org/main/getting-started/tutorials/03-matrix-multiplication.html#sphx-glr-getting-started-tutorials-03-matrix-multiplication-py

njuffa · September 13, 2025, 10:40pm

As far as I know, this is not generally possible for arbitrary matrices A, B. Only when AB == BA is log (AB) = log(A) + log(B). Are the matrices always commutative in your case?

enzo.marsot · September 14, 2025, 10:41am

Thank you for your answer. Sorry, I forgot to mention I’m talking about the elementwise log of the matrices. Basically, the matrices A and B contain very small probabilities (sometimes exp(-6000)) so that I can’t store them without underflowing in standard float32. Now I would need to form the elementwise log of the matrix product, but I’m not sure how to do that efficiently.

My current solution consists in computing the rowwise max of log(A) and columnwise max of log(B), then taking the exponentials of log(A)-rowwisemax and log(B)-columnwisemax, doing matmul, and then taking the log of the matmul + rowwisemax@columnwisemax basically.

I wondered if there were more efficient ways of doing that. It seems right now the exponentiation is the bottleneck by far: I must be saturating the special function unit but I don’t know how to do it any better.

I’m working with an RTX 4090.

Curefab · September 14, 2025, 1:49pm

One element of a matrix multiplication is a sum of products. Now you want to compute the log of this sum. Let’s assume those products=summands are all positive. Then the resulting log is larger than log((max in A row) * (max in B column)) and smaller than log((number of sum terms)*maxA*maxB. That new forum version is really awful for formatting.

Curefab · September 15, 2025, 1:41am

Perhaps a better approach is to roughly normalize the A and B matrices multiplicatively (leads to a constant offset of the result after the log).

And then make a polynomial approximation of the final log. You would do the actual exact matrix multiplication and then create different element-wise powers, which you combine.

It is probably more exact in the end than using log A and log B. But would not need the special function unit.

Not sure whether the normalization helps enough with your small non-representable values.

Curefab · September 15, 2025, 1:52am

Or you do the log A and log B representation, but instead of matrix multiplications, where each resulting element is a scalar product of two vectors (one column and one row), you want just a sum of the two vectors and from it the largest component.

Topic		Replies	Views
Sparse Matrix-Matrix Multiplication CUDA Programming and Performance	14	18204	June 15, 2010
logf(x) CUDA Programming and Performance	1	2087	January 10, 2008
Logarithm in cuda kernel? Some help CUDA Programming and Performance	4	3357	July 16, 2009
CUDA Double Precision with log() and exp() INEXACT RESULTS Lack of precision in Log and Exp function CUDA Programming and Performance	16	5718	December 15, 2009
log() intrinsic uses too many registers Legacy PGI Compilers	3	4456	February 28, 2011
Help with CUDA+Matlab acceleration CUDA Programming and Performance	24	14521	December 17, 2008
Matrix multiplication woes large inner, small outer dimensions CUDA Programming and Performance	21	10132	March 24, 2009
Matrix multiplication CUDA Programming and Performance	3	3799	March 6, 2008
Pls help - Matrix multiplication CUDA Programming and Performance	0	708	February 9, 2011
Matrix Multiplication CUDA Programming and Performance	0	2773	February 7, 2011

Computing the log of a matmul from the log of two matrices

Related topics