Calculation differences between CUDA and MATLAB

SetUP · October 22, 2011, 3:14am

I’ve implemented matrix inverse calculation in CUDA. I tried it with the CULA, and with the CUBLAS libraries too. I have got the datas in Matlab too. After the calculation compared them and realized that the difference between the Matlab and the CUDA is so big and it doesn’t matter that I use the CULA or the CUBLAS functions. The relativ error could reach 1000%. I can’t understand why… The matrix isn’t too big, just a 111*111 . Does anybody have got idea why? I’d appreciate it.

tera · October 22, 2011, 7:06am

Assuming this is not a mixup of column-major and row-major storage, this may also happen if your matrix is close to singular. Iterative refinement is a way to mitigate the problem. See also the Numerical Recipes section on iterative improvement.

SetUP · October 22, 2011, 4:46pm

Thanks. My matrix is symmetric so I can’t mixup the storage, but the second advice is might works. I will find out and let you know.

melonakos · October 24, 2011, 1:36am

Jacket uses both CUBLAS and CULA for various operations and the results identically match the CPU output. So, what you are trying to do should definitely work. You can try Jacket for 15-days free if that’ll help: http://accelereyes.com

laughingrice · October 31, 2011, 11:57pm

Are you working with the same precision for both cases, or one in double and one in single? Matlab usually does double precision and even if you try to use single you need to be very careful that double doesn’t sneak up on you. The differences in accuracy is big. Single supports about 7 significant digits while double around 15. It’s enough that you are doing different precision and 1000% relative difference becomes not so big.

If both are the same precision then you probably have issues with singularity.

_constant · November 1, 2011, 8:31pm

Or even better, try the GPU functionality already in Matlab (Parallell computing toolbox): MATLAB GPU Computing Support for NVIDIA CUDA Enabled GPUs - MATLAB & Simulink

I heard that it is pretty good :)

laughingrice · November 1, 2011, 11:01pm

Although I have very little experience with jacket (played a bit with a trial license), I can tell you that the parallel computing toolbox is much more limited than jacket. Subscripting only arrived in matlab 2011a and it still doesn’t support bsxfun. Matrix multiply times are also a bit slower.

Topic		Replies	Views
CUDA Double Precision with log() and exp() INEXACT RESULTS Lack of precision in Log and Exp function CUDA Programming and Performance	16	5605	December 15, 2009
Help with CUDA+Matlab acceleration CUDA Programming and Performance	24	14493	December 17, 2008
Float accuracy : OpenCL and CUDA CUDA Programming and Performance	4	3418	August 5, 2010
CUDA slows Matlab down After GPU computation Matlab does not use all 4 processors CUDA Programming and Performance	14	6780	March 10, 2009
Image Denoising Accuracy MATLAB vs. CUDA CUDA Programming and Performance	6	1541	April 1, 2011
MATLAB + CUDA parallelization using cuda on matlab program CUDA Programming and Performance	3	2539	December 5, 2008
about inversion of complex matrix CUDA Programming and Performance	6	1411	April 14, 2014
matrix multiplication with large dimensions CUDA Programming and Performance	7	1584	April 9, 2011
Precision of floats does CUDA use half precision instead of single precision for floats? CUDA Programming and Performance	5	2284	March 15, 2010
numerous, but small-sized matrix inversions looking 4 advise how-to speed-up problem CUDA Programming and Performance	4	3151	August 20, 2008

Calculation differences between CUDA and MATLAB

Related topics