The CUFFT library manual is not very elaborate on this. It only says it’s for doing multiple FFTs in parallel. Does that mean multiple FFTs on the same exact dataset? Or on different parts of the input array?
Single batched 1D plan execution transforms N vectors – one transform per each vector in the batch. Stride between vectors is currently always zero (non-adjustable yet)