“mad24” is multiply-add instruction for integer on CUDA?
there are lots of “a*b+c”(int a,b,c) in my code. but why I can’t find “mad24” on its ptx?
my Gpu is G260.
How can I employ multiply-add operation of integer on CUDA to speedup my app?
“mad24” is multiply-add instruction for integer on CUDA?
there are lots of “a*b+c”(int a,b,c) in my code. but why I can’t find “mad24” on its ptx?
my Gpu is G260.
How can I employ multiply-add operation of integer on CUDA to speedup my app?