cublas_sdot bug ? sdot should be single precision

nvwalker · January 20, 2009, 9:56pm

cublas_sdot appears to deliver a double precision result. Shouldn’t this be a single proecision blas function ?
( cublas_ddot is the double precision function) .
System : CUDA 2.0 on Linux - RedHat EL5 x86_64.

Test code follows. Answer should be 204, but doesn’t work when cublas_sdot is declared real.

  program sdot_test
  implicit real (a-h,o-z)
  integer*4 size,dev_x
  double precision cublas_sdot

c real cublas_sdot
parameter (n=8,size=4)
dimension y(n),z(n)
call cublas_init()
do j=1,n
y(j)=float(j)
enddo
call cublas_Alloc(n,size,dev_x)
call cublas_Set_Vector(n,size,y,1,dev_x,1)
call cublas_Get_Vector(n,size,dev_x,1,z(1),1)
s0=cublas_sdot(n,dev_x,1,dev_x,1)
s1=sdot(n,y(1),1,y(1),1)
s2=sdot(n,z(1),1,z(1),1)
print *,‘s0,s1,s2’,s0,s1,s2
call cublas_free(dev_x)
stop
end

mfatica · January 21, 2009, 6:58pm

The problem is in the fortran.c wrapper, not in cublas.
You can easily modify the fortran.c and replace:
#if CUBLAS_FORTRAN_COMPILER==CUBLAS_G77
double CUBLAS_SDOT (const int *n, const float *x, const int *incx, float *y, const int *incy)
#else
float CUBLAS_SDOT (const int *n, const float *x, const int *incx, float *y, const int *incy)
#endif

removing the incorrect double declaration and leaving the correct single precision one:
float CUBLAS_SDOT (const int *n, const float *x, const int *incx, float *y, const int *incy)

will give you the correct results:

gcc -c fortran.c -I/usr/local/cuda/include
g95 --no-second-underscore sdot_test.f90 fortran.o -L/usr/local/cuda/lib -lcublas -lmkl -lguide -lpthread

./a.out ( with the real cublas_sdot declaration in your source code)

s0,s1,s2 204. 204. 204.

EDIT: The problem is due to the g77 calling convention:
Functions that return type default REAL actually return the C type
double, and functions that return type COMPLEX return the values via an
extra argument in the calling sequence that points to where to store the
return value.
If you are not using g77 but a more recent compiler (gfortran, g95, ifort, etc), you should change some
of the defines in the fortran.c file.

Topic		Replies	Views
sdot cuda vs cublas result differences Jetson AGX Xavier	7	769	October 18, 2021
cublas cublasSdot can't work with As described Documentation Deep Learning (Training & Inference)	0	518	August 29, 2018
cublasSdot_v2() gives different results when running on different GPU types, GPU-Accelerated Libraries cublas	3	787	July 5, 2022
Doubts about cublasZdotu GPU-Accelerated Libraries	3	1526	November 26, 2012
Issue when calling cublasDdot from within kernel GPU-Accelerated Libraries	7	1024	March 21, 2018
Must the result of cublasSdot be host variable? GPU-Accelerated Libraries	4	1364	October 20, 2016
Why is the cublasSdot() function slow in a certain section? CUDA Programming and Performance	1	422	June 30, 2018
Is this a BUG of CuBLAS output not consistent for each run CUDA Programming and Performance	8	3276	July 29, 2010
Cublas, sum and dot. Newbie question. CUDA Programming and Performance	6	5627	November 29, 2012
Certain CUBLAS operations return 0 when called from Fortran CUDA Programming and Performance	0	877	March 18, 2010

cublas_sdot bug ? sdot should be single precision

Related topics