double precision emulation implementing double precision in CUDA

egaburov · February 13, 2008, 11:38pm

Ladies & Gentlemen around here,

I am currently interested in implementing double precision emulation in CUDA; not fully, but limited to subtraction only. That is, I have two double precision numbers, for example x and y, and I need to know the difference z = x-y. The number x & y must be in double precision, but the difference can be single precision. In this way, I do not loose significant digits in z.

As we all now, CUDA currently does not support double precision arithmetics, but for some scientific calculations it is crucial to perform some arithmetics, especially differences between large numbers, in double precision; otherwise, the result will have a very large error! Gladly, not all code has to be single precision, and most of the time these differences can be stored as a single precision number (though the actual difference must be in computed in double precision, and variables whose difference is required must also be in double precision). Performance impact won’t be large, as far as CUDA kernel is bandwidth bound.

I am aware of softfloat library (Berkeley SoftFloat) for CPU, and I was wondering if anybody has experience in emulating double precision in CUDA.

Not much have to add, but any comments & suggestions are welcome.

Cheers,
Evgheni

wildcat4096 · February 13, 2008, 11:44pm

Does the Mandelbrot example included with CUDA implement some basic double precision operations?

egaburov · February 14, 2008, 12:01am

Wow, that was fast!

I’ve checked Mandelbrot example, and it appears that here are some calculations done in double precision, including subtraction. I’ll see how can I adopt this to my application.

Thanks!

Ev.

seibert · February 14, 2008, 12:57am

A good place to look for pseudo-double precision algorithms is the dsfun90 library:

[url=“http://crd.lbl.gov/~dhbailey/mpdist/”]http://crd.lbl.gov/~dhbailey/mpdist/[/url]

The representation used in that library is one which represents a double as the sum of two single precision numbers with different exponents. In base 10, this would be like representing 1.00000001 as (1e0 + 1e-8). The addition algorithm is probably the easiest to implement.

egaburov · February 14, 2008, 8:17am

Thanks all for replies! It seems I have enough information too look into. My interests, however, also lie in emulating extended (80) and quad-precision (128) bits, but I guess this will be a step after 64 bit precision emulation.

Cheers,
Evghenii

Topic		Replies	Views
software implementation of double prec math? CUDA Programming and Performance	5	1840	January 8, 2010
Emulated double precision Double single routine header CUDA Programming and Performance	24	49401	October 18, 2010
double and integer 64 bits CUDA Programming and Performance	5	3881	March 22, 2007
How to write double precision code on D870 CUDA Programming and Performance	2	3382	September 12, 2008
double emulation any ideas on double precision emulation? CUDA Programming and Performance	10	8485	March 28, 2008
Double double precision arithmetic library now available CUDA Programming and Performance	14	8618	July 2, 2013
Do the 9400M and 9600M GT support double precision? CUDA Programming and Performance	7	17840	August 13, 2009
Handling Double Precision Operations A few questions about double-precision support CUDA Programming and Performance	2	7167	July 29, 2010
Expected performance of double precision arithmetic CUDA Programming and Performance	8	4086	August 20, 2009
Real time Mandelbrot My first CUDA program CUDA Programming and Performance	8	29367	September 7, 2007

double precision emulation implementing double precision in CUDA

Related topics