What’s the reason behind the restriction
incx != 0 in
It’s obvious that output vectors must have
inc* != 0 (e.g.
cublas*axpy functions). But for input vectors
incx == 0 can be used for passing vectors with constant values without array allocation.
And, for instance, we can use
cublas*gemv to calculate sum of matrix rows/columns.
And it’s not obvious what lead to such restrictions, because code like
x + i * incx is gonna work anyway. May be these restrictions can be lifted without changes in the kernel code?