strange things of cublas 3.0 on RHEL 5.3

shadowperi · April 8, 2010, 2:10pm

I’ve just installed cuda 3.0 and want to test a simple program. I called cublas_sgemm to multiply two 22 matrices(B=AA), which turned out that the routine did nothing. I mean the output matrix B is just the same as the input matrix A. It’s strange! Previously I tried cublas 2.3 on ubuntu 8.04, and this test passed through.

Then I wondered whether or not it caused from the version. Then I tested cuda 2.3 of RHEL 5.3. This time the program leaded to segmentation fault…

Anyone knows the reason. I’m going insane… thanks…

[codebox] program matrixmod

implicit none

integer M, N

parameter (M=2, N=2)

real*4 a(M,N),b(M,N),c(M,N)

integer i, j

do j = 1, N

do i = 1, M

a(i,j) = (i-1) * M + j

enddo

enddo

do j = 1, N

do i = 1, M

b(i,j) = (i-1) * M + j

enddo

enddo

call cublas_sgemm('N','N',2,2,2,1.0,

 &        a,2,a,2,0.0,b,2)

do j = 1, N

do i = 1, M

write(*,"(F7.0$)") b(i,j)

enddo

write (*,*) ""

enddo

write (*,*) ""

do j = 1, N

do i = 1, M

write(*,"(F7.0$)") a(i,j)

enddo

write (*,*) ""

enddo

stop

end[/codebox]

shadowperi · April 10, 2010, 4:11pm

further information:
If I manually allocate device memory by calling cublas_alloc, it leaded to
“device memory allocation failed”, which didn’t occur on ubuntu with cuda 2.3.

philippev · April 14, 2010, 9:03pm

I’ve just installed cuda 3.0 and want to test a simple program. I called cublas_sgemm to multiply two 22 matrices(B=AA), which turned out that the routine did nothing. I mean the output matrix B is just the same as the input matrix A. It’s strange! Previously I tried cublas 2.3 on ubuntu 8.04, and this test passed through.

Then I wondered whether or not it caused from the version. Then I tested cuda 2.3 of RHEL 5.3. This time the program leaded to segmentation fault…

Anyone knows the reason. I’m going insane… thanks…

[codebox] program matrixmod
implicit none

integer M, N

parameter (M=2, N=2)

real*4 a(M,N),b(M,N),c(M,N)

integer i, j

do j = 1, N

do i = 1, M

a(i,j) = (i-1) * M + j

enddo

enddo

do j = 1, N

do i = 1, M

b(i,j) = (i-1) * M + j

enddo

enddo

call cublas_sgemm('N','N',2,2,2,1.0,

 &        a,2,a,2,0.0,b,2)

do j = 1, N

do i = 1, M

write(*,"(F7.0$)") b(i,j)

enddo

write (*,*) ""

enddo

write (*,*) ""

do j = 1, N

do i = 1, M

write(*,"(F7.0$)") a(i,j)

enddo

write (*,*) ""

enddo

stop

end[/codebox]

Are you using the fortran thunking interface or the regular interface ?

in cublas 3.0, the fortran.c has been split into 2 parts : fortran.[c,h] (for the regular interface) and fortran_thunking.[c,h]

If you do not the device allocation yourself, you should use the thunking interface.

Topic		Replies	Views
strange things of cublas 3.0 on RHEL 5.3 CUDA Programming and Performance	0	2683	April 8, 2010
strange things of cublas 3.0 on RHEL 5.3 CUDA Programming and Performance	0	1130	April 8, 2010
Cublas_status_execution_failed GPU-Accelerated Libraries	2	10678	February 23, 2021
A newbie question on cublasSgemm CUDA Programming and Performance	6	4878	May 14, 2008
cgemm operation returns wrong result Error in C Code? CUDA Programming and Performance	8	1697	August 25, 2009
NVBLAS cublasXtZgemm failed with error 3/8 GPU-Accelerated Libraries	4	2016	July 17, 2015
cuBLAS fails when matrix has more than 2^31-1 entries? CUDA Programming and Performance	13	726	October 12, 2021
cublas - cublasSgemm - problem CUDA Programming and Performance	2	2110	March 16, 2010
Cublas sgemm pointer error? Query re error in output of matrix multiplication. CUDA Programming and Performance	5	3402	February 18, 2010
How to use CUBLAS in C ? CUDA Programming and Performance	1	962	August 1, 2011

strange things of cublas 3.0 on RHEL 5.3

Related topics