Hardware accelerated vector operations?

jesusgumbau · May 1, 2009, 9:39am

If I use lots of dot() functions in a CUDA kernel. However, I’ve seen that they are declared simply are simply expanded (so that the dot()function is just translated as: xx+yy+z*z.

I would like to know whether is there any function to perform native dot products on the GPU (without having to perform 3 muls and 2 sums), as the GPU is capable of it (thinking about shaders).

Greetings.

Simon_Green · May 1, 2009, 10:29am

See the FAQ, Q32:
[url=“http://forums.nvidia.com/index.php?showtopic=84440”]http://forums.nvidia.com/index.php?showtopic=84440[/url]

In short, no, current NVIDIA GPUs are scalar within each thread, although you can think of them as vector (SIMD) across the warp.

Topic		Replies	Views
available functions / quickref request CUDA Programming and Performance	2	2185	September 27, 2007
built in dot & cross products CUDA Programming and Performance	1	2501	July 25, 2008
Vector operations in cuda? CUDA Programming and Performance	3	24845	May 8, 2007
Dense matrix vector dot product On a GeForce 9300 GE CUDA Programming and Performance	7	7044	June 10, 2011
build-in cross product and dot product of CUDA CUDA Programming and Performance	1	3376	January 6, 2010
dot/cross built-in functions CUDA Programming and Performance	1	1889	July 24, 2007
SIMD on GPU CUDA Programming and Performance	6	18027	April 29, 2009
Vector operations, swizzle and macros in CUDA CUDA Programming and Performance	3	8954	May 20, 2009
Where are Cg's vector operations in CUDA are vector operations completely missing CUDA Programming and Performance	3	10037	April 2, 2007
Help DotVectors Cuda CUDA Programming and Performance	2	554	November 24, 2017

Hardware accelerated vector operations?

Related topics