why shift is slower than integer multiply shift ,integer multiply

“mad24” is multiply-add instruction for integer on CUDA?

there are lots of “a*b+c”(int a,b,c) in my code. but why I can’t find “mad24” on its ptx?

my Gpu is G260.

How can I employ multiply-add operation of integer on CUDA to speedup my app?