What does predicate register aside operands mean in IMAD instruction?

SparkHu · October 9, 2022, 8:23am

I’m reading sass code in nsight compute and find some sass code like this:

  IMAD.WIDE.U32.X R72, P3, R35, R13, R58, P3

I know the ptx instruction madc. It has only four operands(3 inputs and 1 output). But the sass code above has 6 operands(4 normal register and 2 predicate register). What are these predicate register used for in the instruction?
I have read the documentation of cuda binary utilities but found no explanation about the instruction. Could you explain what this instruction does?
Best Regards.

njuffa · October 9, 2022, 8:56am

NVIDIA does not explain SASS instructions to this level of detail, a stance they have maintained for 15 years, so unlikely to change. If you really must know, you will need to spend some quality time reverse engineering the details by looking at lots of SASS with various IMAD flavors.

This is an IMAD with .X suffix, so used in some sort of extended-precision computation. A wild guess could be that the two predicate registers are used to specify the registers holding carry-in and carry-out. Passing the carry through designated predicate registers allows multiple chains of carry-based dependencies to be alive at the same time; a generalization of the x86-64 ADOX / ADCX scheme which allows for two such dependency chains, if you will.

As I said, a wild guess only.

SparkHu · October 27, 2022, 2:39am

I think your guess is reasonable. Thank you.

system · November 10, 2022, 2:39am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
How many predicate registers a thread has? CUDA Programming and Performance	7	1824	December 28, 2018
registers number CUDA Programming and Performance	1	670	August 1, 2013
[Solved]SASS Code Analysis CUDA Programming and Performance	5	8207	November 30, 2017
Ampere SASS Annotation CUDA Programming and Performance	5	1810	May 1, 2021
Instruction meaning (sass) CUDA Programming and Performance	3	5455	June 10, 2020
cuda SASS question CUDA Programming and Performance	4	1871	June 18, 2018
.CC suffix CUDA Programming and Performance	1	1011	October 23, 2014
A couple basic questions about metric definitions/meaning Nsight Compute	4	558	October 12, 2021
P2R and R2P meaning CUDA Programming and Performance	3	845	January 18, 2019
Used Registers vs Live Registers CUDA Programming and Performance	14	3389	June 28, 2020

What does predicate register aside operands mean in IMAD instruction?

Related topics