How to perform a fast image warp?

What would be the best way to perform a fast affine image warp in CUDA?

Using a 2D texture. See the “simpleTexture” sample in the SDK.