wrong results given by the GPU the classical matr mul using shared memory outpus wrong results

hypermb · September 27, 2010, 7:58pm

i’ve tried to implement the classical matrix multiplication method that uses shared memory on a GTS250 ; but gpu seems to output some slightly different results (sometines a big difference) than that computed in a cpu code …
i’ve tried the same code in an emulation mode (using -deviceemu) and results were 100% correct …
then that shows that there’s something wrong with the device ??
what could be the reason ??

hypermb · September 27, 2010, 7:58pm

i’ve tried to implement the classical matrix multiplication method that uses shared memory on a GTS250 ; but gpu seems to output some slightly different results (sometines a big difference) than that computed in a cpu code …
i’ve tried the same code in an emulation mode (using -deviceemu) and results were 100% correct …
then that shows that there’s something wrong with the device ??
what could be the reason ??

hypermb · October 3, 2010, 9:41pm

nothing explains the errors that are obtained ?
could it be that in simulation, i didn’t specified the GPU version (GTS250) so he didn’t considered the real amount of shared memory or something like that ?

hypermb · October 3, 2010, 9:41pm

nothing explains the errors that are obtained ?
could it be that in simulation, i didn’t specified the GPU version (GTS250) so he didn’t considered the real amount of shared memory or something like that ?

HiRez · October 3, 2010, 11:19pm

What data type are you using?

HiRez · October 3, 2010, 11:19pm

What data type are you using?

hypermb · October 5, 2010, 2:06pm

data type : float everywhere (cpu or gpu)
i want to know is there someway to specify the amount of shared memory per block in the simulator, as he’ll not able to know that i’m acutally using (gts250)

thnks

hypermb · October 5, 2010, 2:06pm

data type : float everywhere (cpu or gpu)
i want to know is there someway to specify the amount of shared memory per block in the simulator, as he’ll not able to know that i’m acutally using (gts250)

thnks

Topic		Replies	Views
Take Garbage Value wrong output how to use shared memory in a program CUDA Programming and Performance	2	5035	December 23, 2009
shared memory double precision problem! Legacy PGI Compilers	1	1661	October 25, 2018
Unexpected behaviour of matrix multiply demo CUDA Programming and Performance	7	6113	November 11, 2010
multiplication of matrix using shared memory problem of multiplication CUDA Programming and Performance	2	3990	September 30, 2010
Matrix multiplication from CUDA programming guide CUDA Programming and Performance	0	1871	November 23, 2009
Shared memory error CUDA Programming and Performance	1	949	June 24, 2012
BUG? shared memory using in matrixMul CUDA Programming and Performance	0	1100	October 15, 2009
Problem with shared memory CUDA Programming and Performance	6	997	October 23, 2015
Garbage Value Matrix multiplication using shared memory CUDA Programming and Performance	0	4645	September 25, 2009
getting wrong values in matrix multiplication Legacy PGI Compilers	2	2221	November 1, 2011

wrong results given by the GPU the classical matr mul using shared memory outpus wrong results

Related topics