arbitrary precision arithmetic

kandan1976 · May 18, 2019, 6:43am

Hi,
Is there any cuda library for doing arbitrary-precision arithmetic, like for example
multiplication of 2 million decimal-digit numbers?

Thanks much,
mani

njuffa · May 18, 2019, 2:45pm

You might want to take look at CAMPARY:

[url]https://hal.archives-ouvertes.fr/hal-01312858[/url]
Mioara Joldes, Jean-Michel Muller, Valentina Popescu, Warwick Tucker: “CAMPARY: Cuda Multiple Precision Arithmetic Library and Applications”, 5th International Congress on Mathematical Software (ICMS), July 2016, Berlin, Germany

The best link to the software itself that I could find in a five-second search is [url]http://homepages.laas.fr/mmjoldes/campary/[/url], but by all means check the usual open software repositories as well.

An older project is CUMP:

[url]https://github.com/skystar0227/CUMP[/url]
T. Nakayama and D. Takahashi: “Implementation of Multiple-Precision Floating-Point Arithmetic Library for GPU Computing”, Proc. 23rd IASTED International Conference on Parallel and Distributed Computing and Systems (PDCS 2011), pp. 343–349 (2011).

Even older is gpuprec:

[url]https://github.com/lumianph/gpuprec[/url]
Mian Lu, Bingsheng He, and Qiong Luo: “Supporting extended precision on graphics processors”. DaMoN '10 Proceedings of the Sixth International Workshop on Data Management on New Hardware, June 2010, pp. 19-26.

I have not used any of the above.

kandan1976 · May 18, 2019, 3:40pm

Thanks a lot for your time.

Mani

cbuchner1 · May 19, 2019, 3:59pm

directly from nVidia: https://github.com/NVlabs/xmp

2 million decimals is a lot. I am not sure if there is a practical general purpose library that can do it.

njuffa · May 19, 2019, 4:04pm

Interesting. I wasn’t aware of an effort by NVlabs to produce such a library. Is there a published paper available somewhere? Or at least a GTC presentation slide deck?

cbuchner1 · May 19, 2019, 4:45pm

Voilà

cbuchner1 · May 19, 2019, 4:47pm

this library is only good (optimized) up to the Maxwell microarchitecture

For best performance on Pascal, force it to multiply using XMAD, for Volta and Turing, force IMAD.

For peak performance on Pascal, stick with CUDA 8.0 - Volta and Turing can use the later CUDA releases.

Robert_Crovella · May 19, 2019, 7:52pm

XMP paper:

[url]http://www.acsel-lab.com/arithmetic/arith23/data/1616a047.pdf[/url]

A_F · December 19, 2020, 8:47am

Hi,

Do you know a place where I can find documentation about CAMPARY?
I was not able to find a guide or neat examples.

The problem that I want to solve requires arrays of numbers with precision higher than doubles (multi-precision). I need to continuously work with them in the CPU, transfer them to the GPU, transfer the results from the GPU to the CPU,… (always preserving the precision)

The operations that I use are just +, -, *, /, and they are implemented in CAMPARY, but I do not know how to implement the declarations of arrays in the CPU-GPU and transfer these arrays (preserving the precision) from the CPU (GPU) to the GPU (CPU). If you can provide a basic example it would be great!!

Thanks

njuffa · December 19, 2020, 9:19am

I was under impression that CAMPARY is open-source software. If so, “Use the source, Luke!”

A_F · December 19, 2020, 10:10am

Thanks for the suggestion.
The problem is that the source:

does not provide information, just the “.h” files.

My needs could be fixed with a basic example (CPU-GPU transfer of multi-precision arrays preserving the precision), instead of diving in the codes.

njuffa · December 19, 2020, 8:08pm

Yes, “use the source” means diving into the code. Not every open-source project comes with docs and/or neat examples. CAMPARY is a header-file library so the entire source is in those .h files.

A quick look at the header files indicates that the multi-precision types are simply arrays of doubles (e.g. quadruple precision: four doubles), so multi-precision operands can be copied trivially between CPU and GPU.

Robert_Crovella · December 24, 2020, 3:21pm

This may be of interest.

jeanmonet · February 12, 2021, 7:29pm

CGBN seems to be a good solution: https://github.com/NVlabs/CGBN
However it has not been updated for 2 years and does not support Turing architecture and later. Do maintainers intend to add support for latest architectures?

camilo.nunezf · June 11, 2021, 3:38pm

I used the library for my project in a Ampere architecture and I did’n have any problems. Just change the -arch flag in you nvcc command. I used -arch=sm_80 for Ampere’s architecture. Try to update the Makefiles with this and test in you device.

For example, in the makefile for samples/sample_01_add I used this:

nvcc $(INC) $(LIB) -I../../include -arch=sm_80 add.cu -o add -lgmp

al.lvov777 · June 6, 2023, 1:17pm

Hey there, everyone! I have a question about arbitrary arithmetic inside Windows ecosystem. Which library could be easily used inside Windows OS? I’ve tried to use NVlabs GCBN library, but it failes to build with the MSVC toolchain. Anyone succeeded on using arbitrary arithmetics inside CUDA on Windows?

Topic		Replies	Views
GPUMP Multiple precision arithmetic on the gpu. CUDA Programming and Performance	0	1115	January 21, 2012
GPUMP Multiple precision arithmetic on the gpu. CUDA Programming and Performance	6	7346	January 22, 2012
is there any cuda library for doing arbitrary-precision arithmetic? GPU-Accelerated Libraries	2	964	June 8, 2019
Is there any arbitrary precision library for CUDA with VISUAL STUDIO CUDA Programming and Performance	4	1956	July 21, 2018
Adaptive arbitrary precision in CUDA? CUDA Programming and Performance	8	1371	July 1, 2015
Nvidia langauge and gpu or dev board to do 256, 512 and 1024 bit integers for maths calculations CUDA Programming and Performance	9	1180	May 30, 2022
Multi Precision Any Link CUDA Programming and Performance	0	1453	August 21, 2009
multi-precision integer arithmetic on CUDA CUDA Programming and Performance	15	9961	March 16, 2022
Rational numbers / arbitrary-precision integers CUDA Programming and Performance	1	536	July 19, 2016
¿Cómo trabajar GMP, MPIR o MPFR con CUDA? CUDA Programming and Performance spanish	2	593	September 29, 2022

arbitrary precision arithmetic

Related topics