CUBLAS Sgemm Wrong results

instigatorAyaz · March 31, 2013, 11:10am

I am trying to use CUBLASSgemm for matrix multiplication but it gives me wrong result.
I try this in two ways. The first method gives wrong results while the other gives correct but transposed result. Please tell me the right way to use cublasSgemm method to get correct (non-transposed) result for matrix multiplication.
I using it as follows:

cublasSgemm( ‘N’, ‘N’, N, N, N, alpha, d_A, N, d_B, N, beta, d_C, N );

Output:
Matrix A
0.00 3.00 1.00 3.00 1.00 4.00 3.00 0.00
4.00 4.00 4.00 1.00 4.00 3.00 3.00 3.00
0.00 3.00 3.00 3.00 3.00 1.00 2.00 1.00
2.00 1.00 3.00 2.00 3.00 4.00 3.00 4.00
2.00 4.00 2.00 4.00 0.00 0.00 1.00 4.00
1.00 1.00 2.00 2.00 1.00 1.00 0.00 3.00
4.00 0.00 1.00 4.00 1.00 0.00 0.00 0.00
2.00 0.00 4.00 2.00 4.00 2.00 1.00 3.00
Matrix B
1.00 0.00 4.00 1.00 2.00 1.00 0.00 0.00
2.00 3.00 2.00 0.00 1.00 4.00 3.00 2.00
1.00 2.00 1.00 2.00 4.00 3.00 4.00 1.00
3.00 3.00 1.00 4.00 0.00 2.00 2.00 3.00
0.00 4.00 4.00 2.00 0.00 2.00 3.00 4.00
2.00 2.00 4.00 0.00 4.00 4.00 2.00 0.00
1.00 0.00 0.00 1.00 3.00 4.00 4.00 3.00
0.00 0.00 2.00 2.00 0.00 1.00 1.00 0.00
CUBLAS Version = 5000
CUBLAS SGEMM Elapsed Time: 0.000044 s, GFLOPS = 0.023256
Matrix Cublas C
7.00 25.00 22.00 27.00 17.00 13.00 16.00 19.00
34.00 32.00 41.00 43.00 35.00 27.00 22.00 33.00
41.00 35.00 40.00 52.00 29.00 24.00 22.00 43.00
36.00 30.00 48.00 41.00 46.00 46.00 35.00 41.00
42.00 32.00 57.00 44.00 55.00 34.00 30.00 42.00
28.00 46.00 40.00 52.00 28.00 22.00 24.00 38.00
34.00 20.00 34.00 47.00 24.00 18.00 12.00 37.00
9.00 9.00 15.00 16.00 14.00 11.00 10.00 13.00
Matrix Host C
27.00 32.00 30.00 19.00 32.00 51.00 42.00 29.00
28.00 45.00 63.00 33.00 49.00 69.00 63.00 40.00
22.00 38.00 30.00 28.00 25.00 46.00 47.00 36.00
24.00 35.00 51.00 33.00 42.00 57.00 52.00 32.00
25.00 28.00 30.00 31.00 19.00 40.00 36.00 25.00
13.00 19.00 24.00 21.00 15.00 24.00 23.00 14.00
17.00 18.00 25.00 24.00 12.00 17.00 15.00 17.00
17.00 34.00 44.00 33.00 31.00 41.00 43.00 29.00

If I use it as follows then it gives correct results but transposed.

cublasSgemm( ‘T’, ‘T’, N, N, N, alpha, d_A, N, d_B, N, beta, d_C, N );

Output:
Using Matrix Sizes: (8 x 8)

Matrix A
0.00 3.00 1.00 3.00 1.00 4.00 3.00 0.00
4.00 4.00 4.00 1.00 4.00 3.00 3.00 3.00
0.00 3.00 3.00 3.00 3.00 1.00 2.00 1.00
2.00 1.00 3.00 2.00 3.00 4.00 3.00 4.00
2.00 4.00 2.00 4.00 0.00 0.00 1.00 4.00
1.00 1.00 2.00 2.00 1.00 1.00 0.00 3.00
4.00 0.00 1.00 4.00 1.00 0.00 0.00 0.00
2.00 0.00 4.00 2.00 4.00 2.00 1.00 3.00
Matrix B
1.00 0.00 4.00 1.00 2.00 1.00 0.00 0.00
2.00 3.00 2.00 0.00 1.00 4.00 3.00 2.00
1.00 2.00 1.00 2.00 4.00 3.00 4.00 1.00
3.00 3.00 1.00 4.00 0.00 2.00 2.00 3.00
0.00 4.00 4.00 2.00 0.00 2.00 3.00 4.00
2.00 2.00 4.00 0.00 4.00 4.00 2.00 0.00
1.00 0.00 0.00 1.00 3.00 4.00 4.00 3.00
0.00 0.00 2.00 2.00 0.00 1.00 1.00 0.00
CUBLAS Version = 5000
CUBLAS SGEMM Elapsed Time: 0.000035 s, GFLOPS = 0.029331
Matrix Cublas C
27.00 28.00 22.00 24.00 25.00 13.00 17.00 17.00
32.00 45.00 38.00 35.00 28.00 19.00 18.00 34.00
30.00 63.00 30.00 51.00 30.00 24.00 25.00 44.00
19.00 33.00 28.00 33.00 31.00 21.00 24.00 33.00
32.00 49.00 25.00 42.00 19.00 15.00 12.00 31.00
51.00 69.00 46.00 57.00 40.00 24.00 17.00 41.00
42.00 63.00 47.00 52.00 36.00 23.00 15.00 43.00
29.00 40.00 36.00 32.00 25.00 14.00 17.00 29.00
Matrix Host C
27.00 32.00 30.00 19.00 32.00 51.00 42.00 29.00
28.00 45.00 63.00 33.00 49.00 69.00 63.00 40.00
22.00 38.00 30.00 28.00 25.00 46.00 47.00 36.00
24.00 35.00 51.00 33.00 42.00 57.00 52.00 32.00
25.00 28.00 30.00 31.00 19.00 40.00 36.00 25.00
13.00 19.00 24.00 21.00 15.00 24.00 23.00 14.00
17.00 18.00 25.00 24.00 12.00 17.00 15.00 17.00
17.00 34.00 44.00 33.00 31.00 41.00 43.00 29.00

CudaaduC · March 31, 2013, 5:39pm

Again this question… Linear Algebra libraries use COLUMN MAJOR format! The results are not wrong, rather the parameters you put into the Sgemm call are wrong.

Read the documentation for cuBLAS, see this thread

[url]cublasSgemm_v2 returns incorrect Matrix Multiplication results - GPU-Accelerated Libraries - NVIDIA Developer Forums

instigatorAyaz · April 1, 2013, 2:55am

Thanks for the reply.
OK, i was using it incorrectly. Now, it is working fine if I switch the parameters A and B to be handled has column major.

cublasSgemm( ‘N’, ‘N’, N, N, N, alpha, d_B, N, d_A, N, beta, d_C, N );

Sorry for any inconvenience.

CudaaduC · April 1, 2013, 6:29am

Not a problem, just always look at the documentation for the library. Coming from a comp sci background you probably were expecting row-major.
Getting used to these issues is worth the effort given the speed of *gemm operations.

Topic		Replies	Views
cublasSgemm wrong return value CUDA Programming and Performance	15	6442	November 12, 2010
cublasSgemm_v2 returns incorrect Matrix Multiplication results GPU-Accelerated Libraries	1	3489	March 15, 2013
Matrix multiplication with CublasSgemm CUDA Programming and Performance	3	8871	January 13, 2013
beginner CUBLAS Sgemm question CUDA Programming and Performance	2	1778	March 9, 2010
How to use cublasSgemm ? CUDA Programming and Performance	3	14355	November 16, 2018
cublasSgemm error? CUDA Programming and Performance	2	5322	July 12, 2007
Question in using Sgemm CUDA Programming and Performance	2	819	May 18, 2011
cublasSgemm - how to correctly use transpose? CUDA Programming and Performance	0	3502	October 13, 2008
CUBLAS, simple product of matrix CUDA Programming and Performance	3	2293	July 17, 2008
cgemm operation returns wrong result Error in C Code? CUDA Programming and Performance	8	1805	August 25, 2009

CUBLAS Sgemm Wrong results

Related topics