why shift is slower than integer multiply shift ,integer multiply

azaonline July 1, 2010, 11:00am 21

“mad24” is multiply-add instruction for integer on CUDA?

there are lots of “a*b+c”(int a,b,c) in my code. but why I can’t find “mad24” on its ptx?

my Gpu is G260.

How can I employ multiply-add operation of integer on CUDA to speedup my app?

Topic		Replies	Views
Measurements of different CUDA operator throughputs CUDA Programming and Performance	32	49880	August 24, 2009
32-bit number multiplication CUDA Programming and Performance	23	20447	July 1, 2012
Cuda 3.5 Integer Multiply Performance Is it really 3x slower than 64-bit floating point? CUDA Programming and Performance	21	19889	March 12, 2014
Bug with integer division? CUDA Programming and Performance	33	9327	September 9, 2015
matrix multiply reduction CUDA Programming and Performance	41	35539	January 15, 2011
CUDA, more threads for same work = Longer run time despite better occupancy, Why? CUDA Programming and Performance	9	6002	March 25, 2010
Mythical Tflops CUDA Programming and Performance	11	1080	January 14, 2019
GPU/CPU precision comparison and Kernel instructions question CUDA Programming and Performance	5	675	April 4, 2017
Warp shuffle instruction not working as expected CUDA Programming and Performance	7	784	September 6, 2023
Memory problem? ...incredible slowdown CUDA Programming and Performance	29	16290	January 30, 2011