Assembler instructions on 80xx platform

rsz · June 18, 2007, 1:30am

Hi,

I’ve got some questions which can (most likely) only be answered by someone of Nvidia. All are more or less assembler-related, so please don’t blame me for asking stuff that is not supported so far…

(1) add-with-carry

Is there an add-with-carry instruction on the 80xx platform?

Some thoughts about adc:

I guess there would be mnemonics to set and get the carry as well?!
Which instructions change the carry? What happens if the threat is stopped before we save the carry? Or is the carry a (thread-owned) register? Just curious…

(2) mulhi, mul, etc

What happens when we call mul und mulhi? Does it compute the multiplication twice or is there a way to tell the compiler to compute once and fill two output registers?

(3) PTX assembler

When can we expect a PTX manual and use inline assembler? AFAIK the FAQ says something like “soon”.

Thanks,

Robert

Simon_Green · June 19, 2007, 12:00pm

The hardware does support integer add with carry, but I’m not sure if it’s exposed in PTX currently.
There is mul.wide instruction that will compute 32 * 32 bits and produce a 64-bit result.
The PTX specification and assembler will be included in the CUDA 1.0 release

Topic		Replies	Views
how to implement mul.wide.u32 in C code 32-bit multiplication and 64-bit registers CUDA Programming and Performance	4	2344	July 29, 2009
Integer carry chains CUDA Programming and Performance	5	8705	November 18, 2010
Big Integer Arithmetic Anyone trying to do bign ints on CUDA? CUDA Programming and Performance	2	9823	February 19, 2008
PTX addition with carry in carry out instructions and for loops CUDA Programming and Performance	3	1930	January 10, 2014
32-bit multiplication and 64-bit registers CUDA Programming and Performance	6	6153	December 10, 2008
Writing a function in PTX? Need to hand-code a function in PTX CUDA Programming and Performance	3	3325	September 10, 2008
Large Integers on CUDA CUDA Programming and Performance	2	8917	June 25, 2010
PTX u32 wide multiplication How-to and performance characteristics? CUDA Programming and Performance	7	2115	October 12, 2010
multiply-add operator/function ? page 51 of 0.81 guide CUDA Programming and Performance	2	14947	May 25, 2007
why CUDA 2.0 does not expose all PTX ISA 1.3 ? CUDA Programming and Performance	20	27855	November 5, 2008

Assembler instructions on 80xx platform

Related topics