Using CuBlas from gfortran... need Help!

pierrot91 · June 19, 2017, 9:25am

Hi every body!!! I want to use Cublas from gfortranbut I got some problems :-(
Following Fatica lecture (ppt presentation on the web) i tried the SGEMM example (THUNKING & NON_THUNKING) for a simple example of matrix multiplication. The THUNKING program works fine but the NON_THUNKING is not working. It compiles well, but does not give me the good result for C = A.B ( C=0 instead of 1 in my program). Could some one can help me?

Here is the program:

program ex_sgemm
implicit none
integer :: n,i
real, allocatable :: A(:,:), B(:,:), C(:,:)
integer8 :: devPtrA, devPtrB, devPtrC
integer :: size_of_real = 16
call cublas_init()
! allocation et initialisation des matrices CPU
write(6,) ’ enter n ’
read(5,) n
allocate( A(n,n), B(n,n), C(n,n) )
! transfert des données sur la GPU
call cublas_alloc(nn, size_of_real, devPtrA)
call cublas_alloc(nn, size_of_real, devPtrB)
call cublas_alloc(nn, size_of_real, devPtrC)
A = 1.0
B = 1.0 / float(n)
C = 0.0
call cublas_Set_Matrix(n,n, size_of_real, A, n, devPtrA, n)
call cublas_Set_Matrix(n,n, size_of_real, B, n, devPtrB, n)
call cublas_Set_Matrix(n,n, size_of_real, C, n, devPtrC, n)
! appele la librairie CUBLAS
call cublas_Get_Matrix(n,n, size_of_real, C, n, devPtrC, n)
write(6,*) 'C recuperee ’
do i = 1, 10
write(6,10) C(i,1:10)
enddo

call CUBLAS_SGEMM( ‘n’, ‘n’, n, n, n, 1.0, devPtrA, n, devPtrB, n, 1.0, devPtrC, n )

!recupere la GPU → CPU
call cublas_Get_Matrix(n,n, size_of_real, C, n, devPtrC, n)
write(6,*) 'C calculee ’
do i = 1, 10
write(6,10) C(i,1:10)
enddo
deallocate( A, B, C)
call cublas_free(devPtrA)
call cublas_free(devPtrB)
call cublas_free(devPtrC)
10 format(10(2x,f10.5))
end program ex_sgemm

and the compile.sh:

nvcc -O3 -c /usr/local/cuda/src/fortran.c
gfortran -O3 *.o fortran_non_thunking.f90 -o toto_non
-L/usr/local/cuda/lib64 -lcudart -lcublas

Topic		Replies	Views
Need help writing "non-thunking" code in Fortran This code also uses complex numbers CUDA Programming and Performance	0	1455	October 15, 2009
Non Square Matrix Multiplication on CUDA Matrix Multiplication Help CUDA Programming and Performance	7	4924	June 24, 2009
beginner CUBLAS Sgemm question CUDA Programming and Performance	2	1667	March 9, 2010
CUBLAS routines using Portland under Centos6 and Cuda 5 GPU-Accelerated Libraries	2	1084	April 10, 2013
comparing matmul performance with and without gpu CUDA Programming and Performance	6	1586	November 6, 2016
How to use CUBLAS in C ? CUDA Programming and Performance	1	955	August 1, 2011
cuBLAS handle creation fails CUDA Programming and Performance	1	457	June 13, 2022
cublas matrix-vector problem CUDA Programming and Performance	1	3058	May 15, 2009
cublas sgemm,dgemm performance issue on telsa 10 and gtx 570 GPU-Accelerated Libraries	1	1288	February 24, 2013
About cublas Drop-in replacement? CUDA Programming and Performance	9	2838	October 3, 2009

Using CuBlas from gfortran... need Help!

Here is the program:

and the compile.sh:

Related topics