low cublas single precision accuracy cublasDgemv <-> cublasSgemv

tohahn · July 23, 2008, 2:01pm

Hello everybody!
First post, first problem:

To get started with CUDA I executed some performance tests of functions I need frequently, such as
sgemv:

I got perfectly accurate results using cublasDgemv (compared to a simple CPU implementation), but cublasSgemv creates huge errors:

This simple 3x3 example yields 435462304956416 for the second component on the GPU and 435462338510848 on the CPU. → abs Error = 33554432(!)

(17517372 8222629 16327114) (11549916)
(16646960 19007260 4118818) X (9953420)
( 6989178 16017092 5791423) (13111553)

This cannot possibly have been produced by the FMAD’s truncation?! (programming guide, p. 81)

Has anybody else observed this behaviour? (I used the forum search, didn’t find comparable results.)
If it’s because of the FMADs, is there a way to make cublas use__fadd_rn()/__fmul_rn()?

Any input will be appreciated.
Thanks in advance!

senorbum · July 23, 2008, 3:00pm

Hello everybody!

First post, first problem:

To get started with CUDA I executed some performance tests of functions I need frequently, such as

sgemv:

I got perfectly accurate results using cublasDgemv (compared to a simple CPU implementation), but cublasSgemv creates huge errors:

This simple 3x3 example yields 435462304956416 for the second component on the GPU and 435462338510848 on the CPU. → abs Error = 33554432(!)

(17517372 8222629 16327114) (11549916)

(16646960 19007260 4118818) X (9953420)

( 6989178 16017092 5791423) (13111553)

This cannot possibly have been produced by the FMAD’s truncation?! (programming guide, p. 81)

Has anybody else observed this behaviour? (I used the forum search, didn’t find comparable results.)

If it’s because of the FMADs, is there a way to make cublas use__fadd_rn()/__fmul_rn()?

Any input will be appreciated.

Thanks in advance!

[snapback]414792[/snapback]

Isn’t single precision only accurate to 10^-7 resulting in 7 significant figures? I could be wrong, but I thought this was the case. The figures are fine for this precision. It may seem like a lot for an error, but in terms of magnitude it seems to be about right.

tohahn · July 23, 2008, 3:14pm

Oops, your absolutely right. I must have gotten distracted by the large number…

Thanks!

senorbum · July 23, 2008, 3:38pm

No problem. This is something that would definitely confuse me if I was absorbed in the problem.

Topic		Replies	Views
cublas sgemv accuracy CUDA Programming and Performance	0	7032	May 12, 2007
Question regarding Precision Issues in BLAS CUDA Programming and Performance	9	8680	March 4, 2010
Significant difference in results between MKL-BLAS & CUBLAS different results in Cgemm CUDA Programming and Performance	9	5126	August 31, 2009
sgemm precision wrong results cublasSgemm vs MKL sgemm CUDA Programming and Performance	4	5442	December 22, 2007
cublasSgemm gives incorrect result with big matrix CUDA Programming and Performance cuda	0	405	June 26, 2020
cublas return different result with cpu CUDA Programming and Performance	1	1449	November 25, 2011
cublasSgemm gives incorrect result with big matrix CUDA Programming and Performance cuda	1	475	June 28, 2020
cublas problem with very big matrixes and cublasDgemm slow CUDA Programming and Performance	2	1062	February 23, 2017
Matlab mex file using cublas - problems CUDA Programming and Performance	13	9100	October 13, 2009
Matrix Multiplication by cublasSgemm CUDA Programming and Performance	1	7556	March 26, 2010

low cublas single precision accuracy cublasDgemv <-> cublasSgemv

Related topics