I’m trying to hand-code a function to be used in my CUDA project in PTX. I need to create a function to add two numbers of arbitrary precision (with carry) using the addc.cc (Page 41, PTX ISA Documentation 1.2) assembler operation, since I see no way that this operation is exposed in a level higher than PTX.
Of my limited understanding of PTX so far, one of the best ways to do this is to write a skeleton device function code implementing a basic add operation, and then hand-tune the corresponding PTX file to change the required add operations to addc.cc obtained using nvvc ptx generation and then proceed with the compilation.
Surely there must be an easier way to do this? Is there no way to access add with carry in higher-level CUDA?